Tacotron2-HiFiGAN-master

Implementation of TTS with combination of Tacotron2 and HiFi-GAN for Mandarin TTS.

Inference

In order to inference, we need to download pre-trained tacotraon2 model for mandarin, and place in the root path. Then, we can run infer_tacotron2_hifigan.py to get TTS result. We can alter the input text by editting variablle text in the infer_tacotron2_hifigan.py. Then the result will be saved in the root path named as output.wav.

The pre-trained model of HiFi-GAN has been placed in the LJ_FT_T2_V3, which is trained by LJSppech and fine-tuned with Tacotron2. You can find more pre-trained model from original HiFi-GAN repo with different size and parameters. If you want to try different models or train your own model, please do remember to alter variables in infer_tacotron2_hifigan.py to change the path of HiFi-GAN model.

Audio Sample

Input: 相对论直接和间接的催生了量子力学的诞生也为研究微观世界的高速运动确立了全新的数学模型
Output: tacotron2-hifigan.wav

Implementation of TTS with combination of Tacotron2 and HiFi-GAN

Related tags

Overview

Tacotron2-HiFiGAN-master

Inference

Audio Sample

Owner

SunLu Z

Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context

BERN2: an advanced neural biomedical namedentity recognition and normalization tool

Unsupervised Abstract Reasoning for Raven’s Problem Matrices

DataCLUE: 国内首个以数据为中心的AI测评（含模型分析报告）

Tools, wrappers, etc... for data science with a concentration on text processing

Transformer training code for sequential tasks

Built for cleaning purposes in military institutions

Graph Coloring - Weighted Vertex Coloring Problem

Simple multilingual lemmatizer for Python, especially useful for speed and efficiency

A Facebook Messenger Chatbot using NLP

PyTorch implementation of the paper: Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Natural Language Processing at EDHEC, 2022

The model is designed to train a single and large neural network in order to predict correct translation by reading the given sentence.

In this Notebook I've build some machine-learning and deep-learning to classify corona virus tweets, in both multi class classification and binary classification.

CoSENT 比Sentence-BERT更有效的句向量方案

Transformers implementation for Fall 2021 Clinic

Gpt2-WebAPI - The objective of this API is to provide the 3 best possible responses to sentences that the user would input via http GET request as a parameter

Chinese version of GPT2 training code, using BERT tokenizer.

Library for Russian imprecise rhymes generation