《Image2Reverb: Cross-Modal Reverb Impulse Response Synthesis》(2021)

Last update: Nov 27, 2022

Related tags

Overview

Image2Reverb

Image2Reverb is an end-to-end neural network that generates plausible audio impulse responses from single images of acoustic environments. Code for the paper Image2Reverb: Cross-Modal Reverb Impulse Response Synthesis. The architecture is a conditional GAN with a ResNet50 (pre-trained on Places365 and fine-tuned) image encoder. It generates monoaural audio impulse responses (directly applicable to convolution applications) as magnitude spectrograms.

Dependencies

Model/Data:

PyTorch>=1.7.0
PyTorch Lightning
torchvision
torchaudio
librosa
PyRoomAcoustics
PIL

Eval/Preprocessing:

PySoundfile
SciPy
Scikit-Learn
python-acoustics
google-images-download
matplotlib

Usage

We will make a pre-trained model available soon!

Acknowledgments

We borrow and adapt code snippets from GANSynth (and this PyTorch re-implementation), additional snippets from this PGGAN implementation, and more.

Owner

Nikhil Singh

GitHub Repository

A pytorch implementation of Pytorch-Sketch-RNN

Pytorch-Sketch-RNN A pytorch implementation of https://arxiv.org/abs/1704.03477 In order to draw other things than cats, you will find more drawing da

172 Dec 12, 2022

PyTorch for Semantic Segmentation

PyTorch for Semantic Segmentation This repository contains some models for semantic segmentation and the pipeline of training and testing models, impl

1.7k Jan 06, 2023

An algorithm study of the 6th iOS 10 set of Boost Camp Web Mobile

알고리즘 스터디 🔥 부스트캠프 웹모바일 6기 iOS 10조의 알고리즘 스터디 입니다. 개인적인 사정 등으로 S034, S055만 참가하였습니다. 스터디 목적 상진: 코테 합격 + 부캠끝나고 아침에 일어나기 위해 필요한 사이클 기완: 꾸준하게 자리에 앉아 공부하기 +

2 Jan 11, 2022

SOTR: Segmenting Objects with Transformers [ICCV 2021]

SOTR: Segmenting Objects with Transformers [ICCV 2021] By Ruohao Guo, Dantong Niu, Liao Qu, Zhenbo Li Introduction This is the official implementation

186 Dec 20, 2022

Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)

Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021) Hang Zhou, Yasheng Sun, Wayne Wu, Chen Cha

628 Dec 28, 2022

Boundary-aware Transformers for Skin Lesion Segmentation

Boundary-aware Transformers for Skin Lesion Segmentation Introduction This is an official release of the paper Boundary-aware Transformers for Skin Le

79 Dec 16, 2022

Unpaired Caricature Generation with Multiple Exaggerations

CariMe-pytorch The official pytorch implementation of the paper "CariMe: Unpaired Caricature Generation with Multiple Exaggerations" CariMe: Unpaired

37 Dec 30, 2022

MPI Interest Group on Algorithms on 1st semester 2021

MPI Algorithms Interest Group Introduction Lecturer: Steve Yan Location: TBA Time Schedule: TBA Semester: 1 Useful URLs Typora: https://typora.io Goog

13 Sep 08, 2022

Predicting Event Memorability from Contextual Visual Semantics

0 Oct 06, 2021

Motion planning algorithms commonly used on autonomous vehicles. (path planning + path tracking)

Overview This repository implemented some common motion planners used on autonomous vehicles, including Hybrid A* Planner Frenet Optimal Trajectory Hi

1k Jan 09, 2023

The PyTorch re-implement of a 3D CNN Tracker to extract coronary artery centerlines with state-of-the-art (SOTA) performance. (paper: 'Coronary artery centerline extraction in cardiac CT angiography using a CNN-based orientation classiﬁer')

The PyTorch re-implement of a 3D CNN Tracker to extract coronary artery centerlines with state-of-the-art (SOTA) performance. (paper: 'Coronary artery centerline extraction in cardiac CT angiography

135 Dec 23, 2022

《Image2Reverb: Cross-Modal Reverb Impulse Response Synthesis》(2021)

Related tags

Overview

Image2Reverb

Dependencies

Usage

Acknowledgments

Owner

Nikhil Singh

A pytorch implementation of Pytorch-Sketch-RNN

PyTorch for Semantic Segmentation

An algorithm study of the 6th iOS 10 set of Boost Camp Web Mobile

SOTR: Segmenting Objects with Transformers [ICCV 2021]

Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)

Boundary-aware Transformers for Skin Lesion Segmentation

Unpaired Caricature Generation with Multiple Exaggerations

MPI Interest Group on Algorithms on 1st semester 2021

Predicting Event Memorability from Contextual Visual Semantics

Motion planning algorithms commonly used on autonomous vehicles. (path planning + path tracking)

The PyTorch re-implement of a 3D CNN Tracker to extract coronary artery centerlines with state-of-the-art (SOTA) performance. (paper: 'Coronary artery centerline extraction in cardiac CT angiography using a CNN-based orientation classiﬁer')

以孤立语假设和宽度优先搜索为基础，构建了一种多通道堆叠注意力Transformer结构的斗地主ai

Material for my PyConDE & PyData Berlin 2022 Talk "5 Steps to Speed Up Your Data-Analysis on a Single Core"

A pytorch-based real-time segmentation model for autonomous driving

A PoC Corporation Relationship Knowledge Graph System on top of Nebula Graph.

Implementation of "Bidirectional Projection Network for Cross Dimension Scene Understanding" CVPR 2021 (Oral)

Your interactive network visualizing dashboard

This is the source code for our ICLR2021 paper: Adaptive Universal Generalized PageRank Graph Neural Network.

PyTorch code for Composing Partial Differential Equations with Physics-Aware Neural Networks

Prototype for Baby Action Detection and Classification