HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

Last update: Dec 29, 2022

Related tags

Overview

HiFi++ : a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

This is the unofficial implementation of Vocoder part of HiFi++ : a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement.

Currently, this repo is WIP but you can start your training without any error.

Training:

python train.py --config config_v2.json

Citations:

@misc{https://doi.org/10.48550/arxiv.2203.13086,
  doi = {10.48550/ARXIV.2203.13086},
  
  url = {https://arxiv.org/abs/2203.13086},
  
  author = {Andreev, Pavel and Alanov, Aibek and Ivanov, Oleg and Vetrov, Dmitry},
  
  keywords = {Sound (cs.SD), Machine Learning (cs.LG), Audio and Speech Processing (eess.AS), FOS: Computer and information sciences, FOS: Computer and information sciences, FOS: Electrical engineering, electronic engineering, information engineering, FOS: Electrical engineering, electronic engineering, information engineering},
  
  title = {HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement},
  
  publisher = {arXiv},
  
  year = {2022},
  
  copyright = {arXiv.org perpetual, non-exclusive license}
}

References:

https://github.com/jik876/hifi-gan

HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

Related tags

Overview

HiFi++ : a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

Training:

Citations:

References:

Owner

Rishikesh (ऋषिकेश)

[NeurIPS 2021] Source code for the paper "Qu-ANTI-zation: Exploiting Neural Network Quantization for Achieving Adversarial Outcomes"

pyspark🍒🥭 is delicious，just eat it!😋😋

[CVPR 2022] PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision (Oral)

HODEmu, is both an executable and a python library that is based on Ragagnin 2021 in prep.

PyTorch implementation of our ICCV 2019 paper: Liquid Warping GAN: A Unified Framework for Human Motion Imitation, Appearance Transfer and Novel View Synthesis

[ICCV2021] 3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds

Powerful and efficient Computer Vision Annotation Tool (CVAT)

We will release the code of "ConTNet: Why not use convolution and transformer at the same time?" in this repo

The pytorch implementation of the paper "text-guided neural image inpainting" at MM'2020

The official implementation of the IEEE S&P`22 paper "SoK: How Robust is Deep Neural Network Image Classification Watermarking".

3D Human Pose Machines with Self-supervised Learning

DEMix Layers for Modular Language Modeling

Implementation of the pix2pix model on satellite images

Semantic Bottleneck Scene Generation

Facial Expression Detection In The Realtime

Uses Open AI Gym environment to create autonomous cryptocurrency bot to trade cryptocurrencies.

Naszilla is a Python library for neural architecture search (NAS)

Large scale and asynchronous Hyperparameter Optimization at your fingertip.

NAS-Bench-x11 and the Power of Learning Curves

Anatomy of Matplotlib -- tutorial developed for the SciPy conference