《Image2Reverb: Cross-Modal Reverb Impulse Response Synthesis》(2021)

Last update: Nov 27, 2022

Related tags

Overview

Image2Reverb

Image2Reverb is an end-to-end neural network that generates plausible audio impulse responses from single images of acoustic environments. Code for the paper Image2Reverb: Cross-Modal Reverb Impulse Response Synthesis. The architecture is a conditional GAN with a ResNet50 (pre-trained on Places365 and fine-tuned) image encoder. It generates monoaural audio impulse responses (directly applicable to convolution applications) as magnitude spectrograms.

Dependencies

Model/Data:

PyTorch>=1.7.0
PyTorch Lightning
torchvision
torchaudio
librosa
PyRoomAcoustics
PIL

Eval/Preprocessing:

PySoundfile
SciPy
Scikit-Learn
python-acoustics
google-images-download
matplotlib

Usage

We will make a pre-trained model available soon!

Acknowledgments

We borrow and adapt code snippets from GANSynth (and this PyTorch re-implementation), additional snippets from this PGGAN implementation, and more.

Owner

Nikhil Singh

GitHub Repository

Local-Global Stratified Transformer for Efficient Video Recognition

DualFormer This repo is the implementation of our manuscript entitled "Local-Global Stratified Transformer for Efficient Video Recognition". Our model

19 Dec 07, 2022

Retinal Vessel Segmentation with Pixel-wise Adaptive Filters (ISBI 2022)

Official code of Retinal Vessel Segmentation with Pixel-wise Adaptive Filters and Consistency Training (ISBI 2022)

14 Oct 27, 2022

Reproduce partial features of DeePMD-kit using PyTorch.

DeePMD-kit on PyTorch For better understand DeePMD-kit, we implement its partial features using PyTorch and expose interface consuing descriptors. Tec

8 Dec 17, 2022

T-LOAM: Truncated Least Squares Lidar-only Odometry and Mapping in Real-Time

T-LOAM: Truncated Least Squares Lidar-only Odometry and Mapping in Real-Time The first Lidar-only odometry framework with high performance based on tr

183 Dec 01, 2022

This repository contains code for the paper "Disentangling Label Distribution for Long-tailed Visual Recognition", published at CVPR' 2021

Disentangling Label Distribution for Long-tailed Visual Recognition (CVPR 2021) Arxiv link Blog post This codebase is built on Causal Norm. Install co

85 Oct 18, 2022

Empower Sequence Labeling with Task-Aware Language Model

LM-LSTM-CRF Check Our New NER Toolkit 🚀 🚀 🚀 Inference: LightNER: inference w. models pre-trained / trained w. any following tools, efficiently. Tra

838 Jan 05, 2023

This is a model made out of Neural Network specifically a Convolutional Neural Network model

This is a model made out of Neural Network specifically a Convolutional Neural Network model. This was done with a pre-built dataset from the tensorflow and keras packages. There are other alternativ

9 Oct 18, 2022

This is a JAX implementation of Neural Radiance Fields for learning purposes.

learn-nerf This is a JAX implementation of Neural Radiance Fields for learning purposes. I've been curious about NeRF and its follow-up work for a whi

62 Dec 20, 2022

Boosting Adversarial Attacks with Enhanced Momentum (BMVC 2021)

EMI-FGSM This repository contains code to reproduce results from the paper: Boosting Adversarial Attacks with Enhanced Momentum (BMVC 2021) Xiaosen Wa

10 Sep 26, 2022

Pytorch implementation of NEGEV method. Paper: "Negative Evidence Matters in Interpretable Histology Image Classification".

Pytorch 1.10.0 code for: Negative Evidence Matters in Interpretable Histology Image Classification (https://arxiv. org/abs/xxxx.xxxxx) Citation: @arti

4 Dec 01, 2022

《Image2Reverb: Cross-Modal Reverb Impulse Response Synthesis》(2021)

Related tags

Overview

Image2Reverb

Dependencies

Usage

Acknowledgments

Owner

Nikhil Singh

Local-Global Stratified Transformer for Efficient Video Recognition

Retinal Vessel Segmentation with Pixel-wise Adaptive Filters (ISBI 2022)

Reproduce partial features of DeePMD-kit using PyTorch.

T-LOAM: Truncated Least Squares Lidar-only Odometry and Mapping in Real-Time

This repository contains code for the paper "Disentangling Label Distribution for Long-tailed Visual Recognition", published at CVPR' 2021

Empower Sequence Labeling with Task-Aware Language Model

This is a model made out of Neural Network specifically a Convolutional Neural Network model

This is a JAX implementation of Neural Radiance Fields for learning purposes.

Boosting Adversarial Attacks with Enhanced Momentum (BMVC 2021)

Pytorch implementation of NEGEV method. Paper: "Negative Evidence Matters in Interpretable Histology Image Classification".

A parallel framework for population-based multi-agent reinforcement learning.

DeepMind Alchemy task environment: a meta-reinforcement learning benchmark

DiSECt: Differentiable Simulator for Robotic Cutting

A Jinja extension (compatible with Flask and other frameworks) to compile and/or compress your assets.

A PyTorch Reimplementation of TecoGAN: Temporally Coherent GAN for Video Super-Resolution

ESL: Event-based Structured Light

Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System

Learning to Stylize Novel Views

Multimodal commodity image retrieval 多模态商品图像检索

Arch-Net: Model Distillation for Architecture Agnostic Model Deployment