A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"

Last update: Jun 04, 2022

Related tags

Deep Learning LASAFT-Net-v2

Overview

LASAFT-Net-v2

Listen, Attend and Separate by Attentively aggregating Frequency Transformation

Woosung Choi, Yeong-Seok Jeong, Jinsung Kim, Jaehwa Chung, Soonyoung Jung, and Joshua D. Reiss

Demonstration (under construction)

Experimental Results

Musdb 18

model	vocals	drums	bass	other	AVG
Meta-TasNet	6.40	5.91	5.58	4.19	5.52
AMSS-Net	6.78	5.92	5.10	4.51	5.58
LaSAFT-Net-v1	7.33	5.68	5.63	4.87	5.88
LASAFT-Net-v2	7.57	6.13	5.28	4.87	5.96

MDX Challenge (Leaderboard A)

model	model type	vocals	drums	bass	other	AVG
KUILAB-MDX-Net	dedicated (1 source/ 1 model)	8.901	7.173	7.232	5.636	7.236
LaSAFT-Net-v1 (light)	conditioned (4 sources/ 1 model)	7.275	5.935	5.823	4.557	5.897
LASAFT-Net-v2 (light)	conditioned (4 sources/ 1 model)	7.324	5.976	5.884	4.642	5.957

How to reproduce

1. Environment

Ubuntu 20.04
wandb for logging

You must create .env file by copying .env.sample to set environmental variables.

wandb_api_key=[Your Key] # "xxxxxxxxxxxxxxxxxxxxxxxx"
data_dir=[Your Path] # "/home/ielab/repos/musdbHQ"

about wandb_api_key
- we currently only support wandb for logging.
- for wandb_api_key, visit wandb, go to setting, and then copy your api key
about data_dir
- the absolute path where datasets are stored

2. Installation (cuda)

conda env create -f environment.yaml -n lasaftv2
conda activate lasaftv2
pip install -r requirements.txt

A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"

Related tags

Overview

LASAFT-Net-v2

Listen, Attend and Separate by Attentively aggregating Frequency Transformation

Experimental Results

How to reproduce

1. Environment

2. Installation (cuda)

Owner

Woosung Choi

Action Recognition for Self-Driving Cars

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

A visualisation tool for Deep Reinforcement Learning

Stable Neural ODE with Lyapunov-Stable Equilibrium Points for Defending Against Adversarial Attacks

DeepFaceLive - Live Deep Fake in python, Real-time face swap for PC streaming or video calls

Using BERT+Bi-LSTM+CRF

Sketch-Based 3D Exploration with Stacked Generative Adversarial Networks

PyTorch implementation of our CVPR2021 (oral) paper "Prototype Augmentation and Self-Supervision for Incremental Learning"

Training and Evaluation Code for Neural Volumes

Implements VQGAN+CLIP for image and video generation, and style transfers, based on text and image prompts. Emphasis on ease-of-use, documentation, and smooth video creation.

This project uses reinforcement learning on stock market and agent tries to learn trading. The goal is to check if the agent can learn to read tape. The project is dedicated to hero in life great Jesse Livermore.

Generative Query Network (GQN) in PyTorch as described in "Neural Scene Representation and Rendering"

TyXe: Pyro-based BNNs for Pytorch users

TAP: Text-Aware Pre-training for Text-VQA and Text-Caption, CVPR 2021 (Oral)

Attention-based Transformation from Latent Features to Point Clouds (AAAI 2022)

Eff video representation - Efficient video representation through neural fields

50-days-of-Statistics-for-Data-Science - This repository consist of a 50-day program

MDMM - Learning multi-domain multi-modality I2I translation

A Robust Unsupervised Ensemble of Feature-Based Explanations using Restricted Boltzmann Machines

PyTorchVideo is a deeplearning library with a focus on video understanding work