A Pytorch Implementation of ClariNet

Last update: Sep 15, 2022

Overview

ClariNet

A Pytorch Implementation of ClariNet (Mel Spectrogram --> Waveform)

Requirements

PyTorch 0.4.1 & python 3.6 & Librosa

Examples

Step 1. Download Dataset

LJSpeech : https://keithito.com/LJ-Speech-Dataset/

Step 2. Preprocessing (Preparing Mel Spectrogram)

python preprocessing.py --in_dir ljspeech --out_dir DATASETS/ljspeech

Step 3. Train Gaussian Autoregressive WaveNet (Teacher)

python train.py --model_name wavenet_gaussian --batch_size 8 --num_blocks 2 --num_layers 10

Step 4. Synthesize (Teacher)

--load_step CHECKPOINT : the # of the pre-trained teacher model's global training step (also depicted in the trained weight file)

python synthesize.py --model_name wavenet_gaussian --num_blocks 2 --num_layers 10 --load_step 10000 --num_samples 5

Step 5. Train Gaussian Inverse Autoregressive Flow (Student)

--teacher_name (YOUR TEACHER MODEL'S NAME)

--teacher_load_step CHECKPOINT : the # of the pre-trained teacher model's global training step (also depicted in the trained weight file)

--KL_type qp : Reversed KL divegence KL(q||p) or --KL_type pq : Forward KL divergence KL(p||q)

python train_student.py --model_name wavenet_gaussian_student --teacher_name wavenet_gaussian --teacher_load_step 10000 --batch_size 2 --num_blocks_t 2 --num_layers_t 10 --num_layers_s 10 --KL_type qp

Step 6. Synthesize (Student)

--model_name (YOUR STUDENT MODEL'S NAME)

--load_step CHECKPOINT : the # of the pre-trained student model's global training step (also depicted in the trained weight file)

--teacher_name (YOUR TEACHER MODEL'S NAME)

--teacher_load_step CHECKPOINT : the # of the pre-trained teacher model's global training step (also depicted in the trained weight file)

python synthesize_student.py --model_name wavenet_gaussian_student --load_step 10000 --teacher_name wavenet_gaussian --teacher_load_step 10000 --num_blocks_t 2 --num_layers_t 10 --num_layers_s 10 --num_samples 5

References

WaveNet vocoder : https://github.com/r9y9/wavenet_vocoder
ClariNet : https://arxiv.org/abs/1807.07281

A Pytorch Implementation of ClariNet

Related tags

Overview

ClariNet

Requirements

Examples

Step 1. Download Dataset

Step 2. Preprocessing (Preparing Mel Spectrogram)

Step 3. Train Gaussian Autoregressive WaveNet (Teacher)

Step 4. Synthesize (Teacher)

Step 5. Train Gaussian Inverse Autoregressive Flow (Student)

Step 6. Synthesize (Student)

References

Owner

Sungwon Kim

Cross-media Structured Common Space for Multimedia Event Extraction (ACL2020)

A TikTok-like recommender system for GitHub repositories based on Gorse

Ascend your Jupyter Notebook usage

An implementation of Equivariant e2 convolutional kernals into a convolutional self attention network, applied to radio astronomy data.

Text Summarization - WCN — Weighted Contextual N-gram method for evaluation of Text Summarization

Official implementation of the Neurips 2021 paper Searching Parameterized AP Loss for Object Detection.

ML-PersonalWork - Big assignment PersonalWork in Machine Learning, 2021 autumn BUAA.

PFLD pytorch Implementation

ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives

Discord-Protect is a simple discord bot allowing you to have some security on your discord server by ordering a captcha to the user who joins your server.

generate-2D-quadrilateral-mesh-with-neural-networks-and-tree-search

Code for approximate graph reduction techniques for cardinality-based DSFM, from paper

Flexible-Modal Face Anti-Spoofing: A Benchmark

A simple Python configuration file operator.

百度2021年语言与智能技术竞赛机器阅读理解Pytorch版baseline

Efficient and intelligent interactive segmentation annotation software

An Open-Source Tool for Automatic Disease Diagnosis..

Deep Distributed Control of Port-Hamiltonian Systems

In this project I played with mlflow, streamlit and fastapi to create a training and prediction app on digits

When BERT Plays the Lottery, All Tickets Are Winning