EfficientTTS

Unofficial Pytorch implementation of "EfficientTTS: An Efficient and High-Quality Text-to-Speech Architecture"(arXiv).

Disclaimer: Somebody mistakenly think I'm one of the authors. In fact, I am not even in the author list of this paper. I am just a TTS enthusiast. Some important information of the implementation is not presented by the paper. Some model parameters in current version is based on my understanding and exepriments, which may not be consistent with those used by the authors.

Updates

2020/12/23: Mandarin Chinese Samples uploaded. The experiment setting is exactly the same with the LJSpeech example. A complete description of the usage will be soon uploaded.

2020/12/20: Using the HifiGAN finetuned with Tacotron2 GTA mel spectrograms can increase the quality of the generated samples, please see the newly generated-samples

Current status

Implementation of EFTS-CNN + HifiGAN

Setup with virtualenv

$ cd tools
$ make
# If you want to use distributed training, please run following
# command to install apex.
$ make apex

Note: If you want to specify Python version, CUDA version or PyTorch version, please run for example:

$ make PYTHON=3.7 CUDA_VERSION=10.1 PYTORCH_VERSION=1.6

Training

Please go to egs/lj folder, and see run.sh for example use.

Acknowledgement

The code framework is from https://github.com/kan-bayashi/ParallelWaveGAN

Pytorch implementation of

Related tags

Overview

EfficientTTS

Unofficial Pytorch implementation of "EfficientTTS: An Efficient and High-Quality Text-to-Speech Architecture"(arXiv).

Updates

Current status

Setup with virtualenv

Training

Acknowledgement

Owner

Liu Songxiang

Tutorial for the PERFECTING FACTORY 5.0 WITH EDGE-POWERED AI workshop

Multi-Objective Loss Balancing for Physics-Informed Deep Learning

📚 A collection of Jupyter notebooks for learning and experimenting with OpenVINO 👓

PyTorch implementation of 'Gen-LaneNet: a generalized and scalable approach for 3D lane detection'

Point cloud processing tool library.

Short and long time series classification using convolutional neural networks

DGL-TreeSearch and the Gurobi-MWIS interface

FAVD: Featherweight Assisted Vulnerability Discovery

Codes for ACL-IJCNLP 2021 Paper "Zero-shot Fact Verification by Claim Generation"

Large dataset storage format for Pytorch

CVPR 2021 Challenge on Super-Resolution Space

A time series processing library

Functional deep learning

10x faster matrix and vector operations

pytorch implementation for Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network arXiv:1609.04802

YOLOv3 in PyTorch > ONNX > CoreML > TFLite

Self-Supervised Learning for Domain Adaptation on Point-Clouds

A Dataset for Direct Quotation Extraction and Attribution in News Articles.

Hyperopt for solving CIFAR-100 with a convolutional neural network (CNN) built with Keras and TensorFlow, GPU backend

Source code for "OmniPhotos: Casual 360° VR Photography"