Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Grad-TTS

Official implementation of the Grad-TTS model based on Diffusion Probabilistic Modelling. For all details check out our paper accepted to ICML 2021 via this link.

Authors: Vadim Popov*, Ivan Vovk*, Vladimir Gogoryan, Tasnima Sadekova, Mikhail Kudinov.

^{*Equal contribution.}

SPIRAL

Official implementation of SPIRAL: Self-supervised Perturbation-Invariant Representation Learning for Speech Pre-Training. For all details check out our paper accepted to ICLR 2022 via this link.

Authors: Wenyong Huang, Zhenhe Zhang, Yu Ting Yeung, Xin Jiang, Qun Liu.

DiffVC

Official implementation of the paper "Diffusion-Based Voice Conversion with Fast Maximum Likelihood Sampling Scheme" (ICLR 2022, Oral). Link.

Authors: Vadim Popov, Ivan Vovk, Vladimir Gogoryan, Tasnima Sadekova, Mikhail Kudinov, Jiansheng Wei.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
DiffVC		DiffVC
Grad-TTS		Grad-TTS
SPIRAL		SPIRAL
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DiffVC

DiffVC

Grad-TTS

Grad-TTS

SPIRAL

SPIRAL

README.md

README.md

Repository files navigation

Speech-Backbones

Grad-TTS

SPIRAL

DiffVC

About

Releases

Packages

Contributors 4

Languages

huawei-noah/Speech-Backbones

Folders and files

Latest commit

History

Repository files navigation

Speech-Backbones

Grad-TTS

SPIRAL

DiffVC

About

Topics

Resources

Stars

Watchers

Forks

Languages