A Pytorch Implementation of ClariNet

Last update: Sep 15, 2022

Overview

ClariNet

A Pytorch Implementation of ClariNet (Mel Spectrogram --> Waveform)

Requirements

PyTorch 0.4.1 & python 3.6 & Librosa

Examples

Step 1. Download Dataset

LJSpeech : https://keithito.com/LJ-Speech-Dataset/

Step 2. Preprocessing (Preparing Mel Spectrogram)

python preprocessing.py --in_dir ljspeech --out_dir DATASETS/ljspeech

Step 3. Train Gaussian Autoregressive WaveNet (Teacher)

python train.py --model_name wavenet_gaussian --batch_size 8 --num_blocks 2 --num_layers 10

Step 4. Synthesize (Teacher)

--load_step CHECKPOINT : the # of the pre-trained teacher model's global training step (also depicted in the trained weight file)

python synthesize.py --model_name wavenet_gaussian --num_blocks 2 --num_layers 10 --load_step 10000 --num_samples 5

Step 5. Train Gaussian Inverse Autoregressive Flow (Student)

--teacher_name (YOUR TEACHER MODEL'S NAME)

--teacher_load_step CHECKPOINT : the # of the pre-trained teacher model's global training step (also depicted in the trained weight file)

--KL_type qp : Reversed KL divegence KL(q||p) or --KL_type pq : Forward KL divergence KL(p||q)

python train_student.py --model_name wavenet_gaussian_student --teacher_name wavenet_gaussian --teacher_load_step 10000 --batch_size 2 --num_blocks_t 2 --num_layers_t 10 --num_layers_s 10 --KL_type qp

Step 6. Synthesize (Student)

--model_name (YOUR STUDENT MODEL'S NAME)

--load_step CHECKPOINT : the # of the pre-trained student model's global training step (also depicted in the trained weight file)

--teacher_name (YOUR TEACHER MODEL'S NAME)

--teacher_load_step CHECKPOINT : the # of the pre-trained teacher model's global training step (also depicted in the trained weight file)

python synthesize_student.py --model_name wavenet_gaussian_student --load_step 10000 --teacher_name wavenet_gaussian --teacher_load_step 10000 --num_blocks_t 2 --num_layers_t 10 --num_layers_s 10 --num_samples 5

References

WaveNet vocoder : https://github.com/r9y9/wavenet_vocoder
ClariNet : https://arxiv.org/abs/1807.07281

A Pytorch Implementation of ClariNet

Related tags

Overview

ClariNet

Requirements

Examples

Step 1. Download Dataset

Step 2. Preprocessing (Preparing Mel Spectrogram)

Step 3. Train Gaussian Autoregressive WaveNet (Teacher)

Step 4. Synthesize (Teacher)

Step 5. Train Gaussian Inverse Autoregressive Flow (Student)

Step 6. Synthesize (Student)

References

Owner

Sungwon Kim

Repository for self-supervised landmark discovery

[ICCV 2021] Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose Estimation

Seasonal Contrast: Unsupervised Pre-Training from Uncurated Remote Sensing Data

A repo that contains all the mesh keys needed for mesh backend, along with a code example of how to use them in python

Official implementation of MSR-GCN (ICCV 2021 paper)

Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP

Super Pix Adv - Offical implemention of Robust Superpixel-Guided Attentional Adversarial Attack (CVPR2020)

pytorch implementation of ABC : Auxiliary Balanced Classifier for Class-imbalanced Semi-supervised Learning

Implementation of self-attention mechanisms for general purpose. Focused on computer vision modules. Ongoing repository.

[ECCV'20] Convolutional Occupancy Networks

Author Disambiguation using Knowledge Graph Embeddings with Literals

SGPT: Multi-billion parameter models for semantic search

sequitur is a library that lets you create and train an autoencoder for sequential data in just two lines of code

Semi-Supervised Semantic Segmentation via Adaptive Equalization Learning, NeurIPS 2021 (Spotlight)

9th place solution

Dirty Pixels: Towards End-to-End Image Processing and Perception

My published benchmark for a Kaggle Simulations Competition

Code for Multiple Instance Active Learning for Object Detection, CVPR 2021

Project of 'TBEFN: A Two-branch Exposure-fusion Network for Low-light Image Enhancement '

Meta-learning for NLP