A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK

Last update: Dec 28, 2022

Related tags

Deep Learning Pytorch-MBNet

Overview

Pytorch-MBNet

A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK

Training

To train a new model, please run train.py, the input arguments are:

--data_path: The path of the directory containing all .wav files of VCC-2018 and the train/dev/test split files (the files in ./data).
--save_dir: The path of the directory to save the trained models. Please create the directory before training.
--total_steps: The total #training step in the training.
--valid_steps: Do the validation every #(valid_steps) of training update.
--log_steps: Log the tensorboard every #(log_steps) of training update.
--update_freq: Gradient accumulation, the default value is 1 (no accumulation).

Testing

To test on VCC-2018, please run test.py, the input arguments are:

--model_path: The path to the saved model.
--idtable_path: The path to the "judge id-number" mapping table file used during training.
--step: The time step for tensorboard log, which can be the same as the training steps.
--split: The valid/test split of data to be used in the testing.

Inference

After training on the VCC data, the model can be utilized to inference on other data. The input arguments are --data_path, --model_path, --save_dir, which are similar to the above. Notice that the bias-net is not used since in this code the ground-truth judge ids are assumed to be unavailable.

The pre-trained model can be found in ./pre_trained.

A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK

Related tags

Overview

Pytorch-MBNet

Training

Testing

Inference

Owner

The open-source and free to use Python package miseval was developed to establish a standardized medical image segmentation evaluation procedure

This repository lets you interact with Lean through a REPL.

TyXe: Pyro-based BNNs for Pytorch users

QKeras: a quantization deep learning library for Tensorflow Keras

This repo includes our code for evaluating and improving transferability in domain generalization (NeurIPS 2021)

Part-Aware Data Augmentation for 3D Object Detection in Point Cloud

A way to store images in YAML.

Dimension Reduced Turbulent Flow Data From Deep Vector Quantizers

Music Generation using Neural Networks Streamlit App

Find the Heart simple Python Game

A PyTorch implementation of "Predict then Propagate: Graph Neural Networks meet Personalized PageRank" (ICLR 2019).

Reference code for the paper "Cross-Camera Convolutional Color Constancy" (ICCV 2021)

Implementations for the ICLR-2021 paper: SEED: Self-supervised Distillation For Visual Representation.

Data and Code for paper Outlining and Filling: Hierarchical Query Graph Generation for Answering Complex Questions over Knowledge Graph is available for research purposes.

Pytorch implementation of DeePSiM

Official Implementation of "LUNAR: Unifying Local Outlier Detection Methods via Graph Neural Networks"

The codes and related files to reproduce the results for Image Similarity Challenge Track 2.

SEC'21: Sparse Bitmap Compression for Memory-Efficient Training onthe Edge

Hysterese plugin with two temperature offset areas

Code accompanying "Learning What To Do by Simulating the Past", ICLR 2021.