User-friendly Voice Cloning Application

Last update: Dec 30, 2022

Overview

Multi-Language-RTVC stands for Multi-Language Real Time Voice Cloning and is a Voice Cloning Tool capable of transfering speaker-specific audio features to synthesize speeches in that voice based on just a few seconds of unknown audio data.

License

This code is licensed under MIT. For more information regarding the license model or associated duties and rights, click here.

Project History

This project was started in 2021 with the goal of inheriting Corentin Jemine's Real-Time-Voice-Cloning. The project originated from the wish of multi-language support for voice cloning models and is now maintained and enhanced by contributing volunteers.

Contributing

We welcome all those interested in the project, from beginners to experts. The MLRTVC community standard is a nice, open-minded and efficient working climate. We encourage all those with ideas to take part in the project by sharing their thoughts.
There are multiple meaningful ways of contributing:

Developing code (new features, fixes, enhancements)
Writing documentation
Raising issues (bugs, feature requests, enhancement proposals, code refacturing, etc.)
Providing pre-trained models
Participating in community tasks (code reviews, discussions, maintenance, etc.)

For transparacy reasons, we ask you to engage with this project via the official ways (issues, pull requests) to share knowledge and questions publicly. Only in cases where privacy or confidentiality is of great importance, other communication channels are accepted (email, chat, etc.).

Further information can be gained in the Contributing Guidelines.

User-friendly Voice Cloning Application

Related tags

Overview

License

Project History

Contributing

Owner

Sven Eschlbeck

Frescobaldi LilyPond Editor

A voice assistant which can be used to interact with your computer and controls your pc operations

A Youtube audio player for your terminal

Small Python application that links a Digico console and Reaper, handling automatic marker insertion and tracking.

Hide Your Secret Message in any Wave Audio File.

Noinoi music is smoothly playing music on voice chat of telegram.

MUSIC-AVQA, CVPR2022 (ORAL)

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

OpenClubhouse - A third-part web application based on flask to play Clubhouse audio.

commonfate 📦commonfate 📦 - Common Fate Model and Transform.

Code for paper 'Audio-Driven Emotional Video Portraits'.

live coding in python + supercollider

Gradient - A Python program designed to create a reactive and ambient music listening experience

Simple discord bot by @merive 🤖

Some utils for auto speech recognition

Python audio and music signal processing library

𝙰 𝙼𝚞𝚜𝚒𝚌 𝙱𝚘𝚝 𝙲𝚛𝚎𝚊𝚝𝚎𝚍 𝙱𝚢 𝚃𝚎𝚊𝚖𝙳𝚕𝚝 💖

Implementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch

🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.

Okaeri-Music is a telegram music bot project, allow you to play music on voice chat group telegram.