User-friendly Voice Cloning Application

Last update: Dec 30, 2022

Overview

Multi-Language-RTVC stands for Multi-Language Real Time Voice Cloning and is a Voice Cloning Tool capable of transfering speaker-specific audio features to synthesize speeches in that voice based on just a few seconds of unknown audio data.

License

This code is licensed under MIT. For more information regarding the license model or associated duties and rights, click here.

Project History

This project was started in 2021 with the goal of inheriting Corentin Jemine's Real-Time-Voice-Cloning. The project originated from the wish of multi-language support for voice cloning models and is now maintained and enhanced by contributing volunteers.

Contributing

We welcome all those interested in the project, from beginners to experts. The MLRTVC community standard is a nice, open-minded and efficient working climate. We encourage all those with ideas to take part in the project by sharing their thoughts.
There are multiple meaningful ways of contributing:

Developing code (new features, fixes, enhancements)
Writing documentation
Raising issues (bugs, feature requests, enhancement proposals, code refacturing, etc.)
Providing pre-trained models
Participating in community tasks (code reviews, discussions, maintenance, etc.)

For transparacy reasons, we ask you to engage with this project via the official ways (issues, pull requests) to share knowledge and questions publicly. Only in cases where privacy or confidentiality is of great importance, other communication channels are accepted (email, chat, etc.).

Further information can be gained in the Contributing Guidelines.

User-friendly Voice Cloning Application

Related tags

Overview

License

Project History

Contributing

Owner

Sven Eschlbeck

Automatically move or copy files based on metadata associated with the files. For example, file your photos based on EXIF metadata or use MP3 tags to file your music files.

Gammatone-based spectrograms, using gammatone filterbanks or Fourier transform weightings.

Real-time audio visualizations (spectrum, spectrogram, etc.)

Pyrogram bot to automate streaming music in voice chats

Speech recognition module for Python, supporting several engines and APIs, online and offline.

📺Headless全自动B站直播录播、切片、上传一体工具

Read music meta data and length of MP3, OGG, OPUS, MP4, M4A, FLAC, WMA and Wave files with python 2 or 3

A library for augmenting annotated audio data

❤️ This Is The EzilaXMusicPlayer Advaced Repo 🎵

Audio features extraction

spafe: Simplified Python Audio-Features Extraction

MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical Modeling

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

A python library for working with praat, textgrids, time aligned audio transcripts, and audio files.

Python implementation of the Short Term Objective Intelligibility measure

FPGA based USB 2.0 high speed audio interface featuring multiple optical ADAT inputs and outputs

SomaFM Plugin for Kodi

Audio2midi - Automatic Audio-to-symbolic Arrangement

Klangbecken: The RaBe Endless Music Player

Synthesia but open source, made in python and free