Jax/Flax implementation of Variational-DiffWave.

Last update: Dec 16, 2022

Overview

jax-variational-diffwave

Jax/Flax implementation of Variational-DiffWave. (Zhifeng Kong et al., 2020, Diederik P. Kingma et al., 2021.)

DiffWave with Continuous-time Variational Diffusion Models.
DiffWave: A Versatile Diffusion Model for Audio Synthesis, Zhifeng Kong et al., 2020. [arXiv:2009.09761]
Variational Diffusion Models, Diederik P. Kingma et al., 2021. [arXiv:2107.00630]

Requirements

Tested in python 3.7.9 conda environment, requirements.txt

Usage

To train model, run train.py.
Checkpoint will be written on TrainConfig.ckpt, tensorboard summary on TrainConfig.log.

python train.py --data-dir /datasets/ljspeech --from-raw
tensorboard --logdir ./log/

To start to train from previous checkpoint, --load-step is available.

python train.py --load-epoch 10 --config ./ckpt/l1.json

[WIP] To synthesize test set, run synth.py.

python synth.py

[WIP] Pretrained checkpoints are relased on releases.

To use pretrained model, download files and unzip it.
Checkout git repository to proper commit tags and following is sample script.

with open('l1.json') as f:
    config = Config.load(json.load(f))

diffwave = VLBDiffWaveApp(config.model)
diffwave.restore('./l1/l1_99.ckpt')

# mel: [B, T, mel]
audio, _ = diffwave(mel, timesteps=50, key=jax.random.PRNGKey(0))

Jax/Flax implementation of Variational-DiffWave.

Related tags

Overview

jax-variational-diffwave

Requirements

Usage

Owner

YoungJoong Kim

Deep Residual Networks with 1K Layers

PyTorch 1.0 inference in C++ on Windows10 platforms

Inhomogeneous Social Recommendation with Hypergraph Convolutional Networks

MASS (Mueen's Algorithm for Similarity Search) - a python 2 and 3 compatible library used for searching time series sub-sequences under z-normalized Euclidean distance for similarity.

Unofficial JAX implementations of Deep Learning models

Yolo Traffic Light Detection With Python

GeneralOCR is open source Optical Character Recognition based on PyTorch.

OpenIPDM is a MATLAB open-source platform that stands for infrastructures probabilistic deterioration model

Complementary Patch for Weakly Supervised Semantic Segmentation, ICCV21 (poster)

Reinfore learning tool box, contains trpo, a3c algorithm for continous action space

URIE: Universal Image Enhancementfor Visual Recognition in the Wild

Kernel Point Convolutions

A treasure chest for visual recognition powered by PaddlePaddle

Computing Shapley values using VAEAC

[CVPR 2022] TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing

Official code implementation for "Personalized Federated Learning using Hypernetworks"

An Implementation of SiameseRPN with Feature Pyramid Networks

Machine Learning University: Accelerated Computer Vision Class

PyTorch implementation of paper A Fast Knowledge Distillation Framework for Visual Recognition.

Deep learning for spiking neural networks