HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Last update: Dec 27, 2022

HiFiGAN Denoiser

This is a Unofficial Pytorch implementation of the paper HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks.

Citations

@misc{su2020hifigan,
      title={HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks}, 
      author={Jiaqi Su and Zeyu Jin and Adam Finkelstein},
      year={2020},
      eprint={2006.05694},
      archivePrefix={arXiv},
      primaryClass={eess.AS}
}

Requirement

Tested on Python 3.6

pip install -r requirements.txt

Train & Tensorboard

python train.py -c [config yaml file]
tensorboard --logdir log_dir

Inference

python inference.py -p [checkpoint path] -i [input wav path]

Checkpoint :

References

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Denoising Wavenet Generator
StarGAN VC Discriminator
Melgan Multi-Scale Discriminator
Parallel Wavegan
HiFi GAN vocoder's MSD and multi-gpu training code

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Related tags

Overview

HiFiGAN Denoiser

Citations

Requirement

Train & Tensorboard

Inference

Checkpoint :

References

Owner

Rishikesh (ऋषिकेश)

🧮 Matrix Factorization for Collaborative Filtering is just Solving an Adjoint Latent Dirichlet Allocation Model after All

Deep Reinforcement Learning for Keras.

Implementation of Google Brain's WaveGrad high-fidelity vocoder

Proximal Backpropagation - a neural network training algorithm that takes implicit instead of explicit gradient steps

B-cos Networks: Attention is All we Need for Interpretability

Generative Exploration and Exploitation - This is an improved version of GENE.

An implementation of quantum convolutional neural network with MindQuantum. Huawei, classifying MNIST dataset

Lucid Sonic Dreams syncs GAN-generated visuals to music.

SOLO and SOLOv2 for instance segmentation, ECCV 2020 & NeurIPS 2020.

Official repository of DeMFI (arXiv.)

Implementations of paper Controlling Directions Orthogonal to a Classifier

WiFi-based Multi-task Sensing

A 1.3B text-to-image generation model trained on 14 million image-text pairs

Machine learning algorithms for many-body quantum systems

LyaNet: A Lyapunov Framework for Training Neural ODEs

Neural models of common sense. 🤖

An index of algorithms for learning causality with data

Improving Deep Network Debuggability via Sparse Decision Layers

Pre-trained models for a Cascaded-FCN in caffe and tensorflow that segments

Discord bot-CTFD-Thread-Parser - Discord bot CTFD-Thread-Parser