This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).

Last update: Dec 05, 2022

Overview

Non-autoregressive Deep Learning-Based TTS Template

This is a template for the Non-autoregressive TTS model. It contains

Data Preprocessing Pipeline
Data Loader
Model / Trainer
Logger, Postprocessing (logging, synthesizing, plotting, etc..)

How to use it?

Clone the repository.

git clone https://github.com/keonlee9420/Deep-Learning-TTS-Template
cd Deep-Learning-TTS-Template

Replace all MYMODEL strings in this repo with your model name and also rename the file model/MYMODEL.py.
Build your model on model/ and check train.py and synthesize.py.
Use README_template.md for the README.md file of your project.
Feel free to add /img for your model architecture and tensorboard examples. It would also be nice to show your model's output audio in /demo.
Don't forget to update requirements.txt and /config of your project.

Citation

@misc{lee2021deep_learning_tts_template,
  author = {Lee, Keon},
  title = {Deep-Learning-TTS-Template},
  year = {2021},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/keonlee9420/Deep-Learning-TTS-Template}}
}

References

ming024's FastSpeech2

You might also like...

Pytorch Implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension)

DiffSinger - PyTorch Implementation PyTorch implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension). Status

152 Jan 2, 2023

Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"

GradTTS Unofficial Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech" (arxiv) About this repo This is an unoffic

103 Dec 23, 2022

A non-linear, non-parametric Machine Learning method capable of modeling complex datasets

Fast Symbolic Regression Symbolic Regression is a non-linear, non-parametric Machine Learning method capable of modeling complex data sets. fastsr aim

3 Jun 22, 2022

A Robust Non-IoU Alternative to Non-Maxima Suppression in Object Detection

Confluence: A Robust Non-IoU Alternative to Non-Maxima Suppression in Object Detection 1. 介绍用以替代 NMS，在所有 bbox 中挑选出最优的集合。 NMS 仅考虑了 bbox 的得分，然后根据 IOU 来

44 Sep 15, 2022

This project uses Template Matching technique for object detecting by detection of template image over base image.

Object Detection Project Using OpenCV This project uses Template Matching technique for object detecting by detection the template image over base ima

7 May 29, 2022

This project uses Template Matching technique for object detecting by detection of template image over base image

Object Detection Project Using OpenCV This project uses Template Matching technique for object detecting by detection the template image over base ima

4 Nov 16, 2021

Curvlearn, a Tensorflow based non-Euclidean deep learning framework.

English | 简体中文 Why Non-Euclidean Geometry Considering these simple graph structures shown below. Nodes with same color has 2-hop distance whereas 1-ho

123 Dec 12, 2022

ESGD-M - A stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch

53 Dec 29, 2022

Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.

One model to speak them all 🌎 Audio Language Text ▷ Chinese 人人生而自由，在尊严和权利上一律平等。 ▷ English All human beings are born free and equal in dignity and rig

60 Nov 14, 2022

This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).

Related tags

Overview

Non-autoregressive Deep Learning-Based TTS Template

How to use it?

Citation

References

You might also like...

Pytorch Implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension)

Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"

A non-linear, non-parametric Machine Learning method capable of modeling complex datasets

A Robust Non-IoU Alternative to Non-Maxima Suppression in Object Detection

This project uses Template Matching technique for object detecting by detection of template image over base image.

This project uses Template Matching technique for object detecting by detection of template image over base image

Curvlearn, a Tensorflow based non-Euclidean deep learning framework.

ESGD-M - A stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch

Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.

Releases(v1.0.0)

v1.0.0(Jun 15, 2021)

Owner

Keon Lee

PFFDTD is an open-source FDTD simulator for 3D room acoustics

10x faster matrix and vector operations

UT-Sarulab MOS prediction system using SSL models

A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population

This repository introduces a short project about Transfer Learning for Classification of MRI Images.

Pytorch implementation of NeurIPS 2021 paper: Geometry Processing with Neural Fields.

Software for Multimodalty 2D+3D Facial Expression Recognition (FER) UI

SSD-based Object Detection in PyTorch

Implementation of Feedback Transformer in Pytorch

PyTorch implementation of the WarpedGANSpace: Finding non-linear RBF paths in GAN latent space (ICCV 2021)

Official git for "CTAB-GAN: Effective Table Data Synthesizing"

codes for Self-paced Deep Regression Forests with Consideration on Ranking Fairness

Code release for Convolutional Two-Stream Network Fusion for Video Action Recognition

CLIP + VQGAN / PixelDraw

[CVPR2022] Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos

GARCH and Multivariate LSTM forecasting models for Bitcoin realized volatility with potential applications in crypto options trading, hedging, portfolio management, and risk management

MEDS: Enhancing Memory Error Detection for Large-Scale Applications

Code for the bachelors-thesis flaky fault localization

A Flow-based Generative Network for Speech Synthesis

The official codes for the ICCV2021 presentation "Uniformity in Heterogeneity: Diving Deep into Count Interval Partition for Crowd Counting"