Unofficial PyTorch Implementation of Multi-Singer

Last update: Dec 28, 2022

Related tags

Deep Learning Multi-Singer

Overview

Multi-Singer

Unofficial PyTorch Implementation of Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus.

Requirements

See requirements in requirement.txt:

linux
python 3.6
pytorch 1.0+
librosa
json, tqdm, logging

TODO

1026: upload code
1024: implement multi-singer & perceptual loss
1023: implement singer encoder

Getting started

Apply recipe to your own dataset

Put any wav files in data directory
Edit configuration in config/config.yaml

1. Pretrain

Pretrain the Singer Embedding Extractor using repository here, and set the 'enc_model_fpath' in config/config.yaml

Note: Please set params as those in 'encoder/params_data' and 'encoder/params_model'.

2. Preprocess

Extract mel-spectrogram

python preprocess.py -i data/wavs -o data/feature -c config/config.yaml

-i your audio folder

-o output acoustic feature folder

-c config file

3. Train

Training conditioned on mel-spectrogram

python train.py -i data/feature -o checkpoints/ --config config/config.yaml

-i acoustic feature folder

-o directory to save checkpoints

-c config file

4. Inference

python inference.py -i data/feature -o outputs/  -c checkpoints/*.pkl -g config/config.yaml

-i acoustic feature folder

-o directory to save generated speech

-c checkpoints file

-c config file

5. Singing Voice Synthesis

For Singing Voice Synthesis:

Take modified FastSpeech for mel-spectrogram synthesis
Use synthesized mel-spectrogram in Multi-Singer for waveform synthesis.

Acknowledgements

Citation

Please cite this repository by the "Cite this repository" of About section (top right of the main page).

Question

Feel free to contact me at [email protected]

Unofficial PyTorch Implementation of Multi-Singer

Related tags

Overview

Multi-Singer

Requirements

TODO

Getting started

Apply recipe to your own dataset

1. Pretrain

Note: Please set params as those in 'encoder/params_data' and 'encoder/params_model'.

2. Preprocess

3. Train

4. Inference

5. Singing Voice Synthesis

Acknowledgements

Citation

Question

Owner

SunMail-hub

This initial strategy was developed specifically for larger pools and is based on taking a moving average and deriving Bollinger Bands to create a projected active liquidity range.

Official implementation of "Dynamic Anchor Learning for Arbitrary-Oriented Object Detection" (AAAI2021).

Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)

This application is the basic of automated online-class-joiner(for YıldızEdu) within the right time. Gets the ZOOM link by scheduled date and time.

Code repository for "Stable View Synthesis".

ICS 4u HD project, start before-wards. A curtain shooting game using python.

Official repository of the paper Privacy-friendly Synthetic Data for the Development of Face Morphing Attack Detectors

LexGLUE: A Benchmark Dataset for Legal Language Understanding in English

PyTorch Implementation of CycleGAN and SSGAN for Domain Transfer (Minimal)

RoadMap and preparation material for Machine Learning and Data Science - From beginner to expert.

Normalization Matters in Weakly Supervised Object Localization (ICCV 2021)

An open source Python package for plasma science that is under development

source code the paper Fast and Robust Iterative Closet Point.

E2VID_ROS - E2VID_ROS: E2VID to a real-time system

PyTorch implementations of Top-N recommendation, collaborative filtering recommenders.

Perfect implement. Model shared. x0.5 (Top1:60.646) and 1.0x (Top1:69.402).

This repository contains a CBIR system that uses swin transformer to extract image's feature.

VarCLR: Variable Semantic Representation Pre-training via Contrastive Learning

A multi-entity Transformer for multi-agent spatiotemporal modeling.

Pytorch implementation of the AAAI 2022 paper "Cross-Domain Empirical Risk Minimization for Unbiased Long-tailed Classification"