E2e music remastering system - End-to-end Music Remastering System Using Self-supervised and Adversarial Training

Last update: Dec 15, 2022

Overview

End-to-end Music Remastering System

This repository includes source code and pre-trained models of the work End-to-end Music Remastering System Using Self-supervised and Adversarial Training by Junghyun Koo, Seungryeol Paik, and Kyogu Lee.

We provide inference code of the proposed system, which targets to alter the mastering style of a song to desired reference track.

Pre-trained Models

Model	Number of Epochs Trained	Details
Music Effects Encoder	1000	Trained with MTG-Jamendo Dataset
Mastering Cloner	1000	Trained with the above pre-trained Music Effects Encoder and Projection Discriminator

Inference

To run the inference code,

Download pre-trained models above and place them under the folder named 'model_checkpoints' (default)
Prepare input and reference tracks under the folder named 'inference_samples' (default).
Target files should be organized as follow:

    "path_to_data_directory"/"song_name_#1"/input.wav
    "path_to_data_directory"/"song_name_#1"/reference.wav
    ...
    "path_to_data_directory"/"song_name_#n"/input.wav
    "path_to_data_directory"/"song_name_#n"/reference.wav

Run 'inference.py'

python inference.py \
    --ckpt_dir "path_to_checkpoint_directory" \
    --data_dir_test "path_to_directory_containing_inference_samples"

Outputs will be stored under the folder 'inference_samples' (default)

Note: The system accepts WAV files of stereo-channeled, 44.1kHZ, and 16-bit rate. Target files shold be named "input.wav" and "reference.wav".

Configurations of each sub-networks

A detailed configuration of each sub-networks can also be found at

Self_Supervised_Music_Remastering_System/configs.yaml

E2e music remastering system - End-to-end Music Remastering System Using Self-supervised and Adversarial Training

Related tags

Overview

End-to-end Music Remastering System

Pre-trained Models

Inference

Configurations of each sub-networks

Owner

Junghyun (Tony) Koo

Fog Simulation on Real LiDAR Point Clouds for 3D Object Detection in Adverse Weather

details on efforts to dump the Watermelon Games Paprium cart

Tooling for converting STAC metadata to ODC data model

SegNet including indices pooling for Semantic Segmentation with tensorflow and keras

Additional functionality for use with fastai’s medical imaging module

DeepStochlog Package For Python

Code for reproducing our analysis in the paper titled: Image Cropping on Twitter: Fairness Metrics, their Limitations, and the Importance of Representation, Design, and Agency

《A-CNN: Annularly Convolutional Neural Networks on Point Clouds》(2019)

95.47% on CIFAR10 with PyTorch

Visualizing lattice vibration information from phonon dispersion to atoms (For GPUMD)

PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+

Research on Event Accumulator Settings for Event-Based SLAM

Hand Gesture Volume Control | Open CV | Computer Vision

Codes for CyGen, the novel generative modeling framework proposed in "On the Generative Utility of Cyclic Conditionals" (NeurIPS-21)

GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond

The Official PyTorch Implementation of "LSGM: Score-based Generative Modeling in Latent Space" (NeurIPS 2021)

Hashformers is a framework for hashtag segmentation with transformers.

Transfer Learning Shootout for PyTorch's model zoo (torchvision)

Source code for "OmniPhotos: Casual 360° VR Photography"

Code repository for Semantic Terrain Classification for Off-Road Autonomous Driving