This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).

Last update: Dec 05, 2022

Overview

Non-autoregressive Deep Learning-Based TTS Template

This is a template for the Non-autoregressive TTS model. It contains

Data Preprocessing Pipeline
Data Loader
Model / Trainer
Logger, Postprocessing (logging, synthesizing, plotting, etc..)

How to use it?

Clone the repository.

git clone https://github.com/keonlee9420/Deep-Learning-TTS-Template
cd Deep-Learning-TTS-Template

Replace all MYMODEL strings in this repo with your model name and also rename the file model/MYMODEL.py.
Build your model on model/ and check train.py and synthesize.py.
Use README_template.md for the README.md file of your project.
Feel free to add /img for your model architecture and tensorboard examples. It would also be nice to show your model's output audio in /demo.
Don't forget to update requirements.txt and /config of your project.

Citation

@misc{lee2021deep_learning_tts_template,
  author = {Lee, Keon},
  title = {Deep-Learning-TTS-Template},
  year = {2021},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/keonlee9420/Deep-Learning-TTS-Template}}
}

References

ming024's FastSpeech2

You might also like...

Pytorch Implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension)

DiffSinger - PyTorch Implementation PyTorch implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension). Status

152 Jan 2, 2023

Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"

GradTTS Unofficial Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech" (arxiv) About this repo This is an unoffic

103 Dec 23, 2022

A non-linear, non-parametric Machine Learning method capable of modeling complex datasets

Fast Symbolic Regression Symbolic Regression is a non-linear, non-parametric Machine Learning method capable of modeling complex data sets. fastsr aim

3 Jun 22, 2022

A Robust Non-IoU Alternative to Non-Maxima Suppression in Object Detection

Confluence: A Robust Non-IoU Alternative to Non-Maxima Suppression in Object Detection 1. 介绍用以替代 NMS，在所有 bbox 中挑选出最优的集合。 NMS 仅考虑了 bbox 的得分，然后根据 IOU 来

44 Sep 15, 2022

This project uses Template Matching technique for object detecting by detection of template image over base image.

Object Detection Project Using OpenCV This project uses Template Matching technique for object detecting by detection the template image over base ima

7 May 29, 2022

This project uses Template Matching technique for object detecting by detection of template image over base image

Object Detection Project Using OpenCV This project uses Template Matching technique for object detecting by detection the template image over base ima

4 Nov 16, 2021

Curvlearn, a Tensorflow based non-Euclidean deep learning framework.

English | 简体中文 Why Non-Euclidean Geometry Considering these simple graph structures shown below. Nodes with same color has 2-hop distance whereas 1-ho

123 Dec 12, 2022

ESGD-M - A stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch

53 Dec 29, 2022

Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.

One model to speak them all 🌎 Audio Language Text ▷ Chinese 人人生而自由，在尊严和权利上一律平等。 ▷ English All human beings are born free and equal in dignity and rig

60 Nov 14, 2022

This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).

Related tags

Overview

Non-autoregressive Deep Learning-Based TTS Template

How to use it?

Citation

References

You might also like...

Pytorch Implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension)

Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"

A non-linear, non-parametric Machine Learning method capable of modeling complex datasets

A Robust Non-IoU Alternative to Non-Maxima Suppression in Object Detection

This project uses Template Matching technique for object detecting by detection of template image over base image.

This project uses Template Matching technique for object detecting by detection of template image over base image

Curvlearn, a Tensorflow based non-Euclidean deep learning framework.

ESGD-M - A stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch

Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.

Releases(v1.0.0)

v1.0.0(Jun 15, 2021)

Owner

Keon Lee

cisip-FIRe - Fast Image Retrieval

The openspoor package is intended to allow easy transformation between different geographical and topological systems commonly used in Dutch Railway

Measures input lag without dedicated hardware, performing motion detection on recorded or live video

Official code for the paper: Deep Graph Matching under Quadratic Constraint (CVPR 2021)

My personal Home Assistant configuration.

SpanNER: Named EntityRe-/Recognition as Span Prediction

Convert scikit-learn models to PyTorch modules

Face-Recognition-based-Attendance-System - An implementation of Attendance System in python.

SOTA model in CIFAR10

Use unsupervised and supervised learning to predict stocks

Keras implementations of Generative Adversarial Networks.

Py-faster-rcnn - Faster R-CNN (Python implementation)

ncnn is a high-performance neural network inference framework optimized for the mobile platform

GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Models

Weakly Supervised Scene Text Detection using Deep Reinforcement Learning

Graph Posterior Network: Bayesian Predictive Uncertainty for Node Classification (NeurIPS 2021)

How to Leverage Multimodal EHR Data for Better Medical Predictions?

OpenMMLab Computer Vision Foundation

RIFE - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Code for "3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop"