TONet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music

Last update: Dec 01, 2022

Overview

TONet

Introduction

The official implementation of "TONet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music", in ICASSP 2022

We propose TONet, a plug-and-play model that improves both tone and octave perceptions by leveraging a novel input representation and a novel network architecture. Any CFP-input-based Model can be settled in TONet and lead to possible better performance.

Main Results on Extraction Performance

Experiments are done to verify the capability of TONet with various baseline backbone models. Our results show that tone-octave fusion with Tone-CFP can significantly improve the singing voice extraction performance across various datasets -- with substantial gains in octave and tone accuracy.

Getting Started

Download Datasets

After downloading the data, use the txt files in the data folder, and process the CFP feature by feature_extraction.py.

Overwrite the Configuration

The config.py contains all configurations you need to change and set.

Train and Evaluation

python main.py train

python main.py test

Produce the Estimation Digram

Uncomment the write prediction in tonet.py

Model Checkpoints

We provide the best TO-FTANet checkpoints in this link. More checkpoints will be uploaded.

Citing

@inproceedings{tonet-ke2022,
  author = {Ke Chen, Shuai Yu, Cheng-i Wang, Wei Li, Taylor Berg-Kirkpatrick, Shlomo Dubnov},
  title = {TONet: Tone-Octave Network for Singing Melody Extraction  from Polyphonic Music},
  booktitle = {{ICASSP} 2022}
}

TONet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music

Related tags

Overview

TONet

Introduction

Main Results on Extraction Performance

Getting Started

Download Datasets

Overwrite the Configuration

Train and Evaluation

Produce the Estimation Digram

Model Checkpoints

Citing

Owner

Knut(Ke) Chen

Sound-Equalizer- This is a Sound Equalizer GUI App Using Python's PyQt5

This is an AI that runs in the terminal. It is a voice assistant that can do common activities and can also help in your coding doubts like

Spotifyd - An open source Spotify client running as a UNIX daemon.

All-In-One Digital Audio Workstation and Plugin Suite

Praat in Python, the Pythonic way

A simple python script to play bell sound in your system infinitely, just for fun and experimental purposes

ianZiPu is a way to write notation for Guqin (古琴) music.

The venturimeter works on the principle of Bernoulli's equation, i.e., the pressure decreases as the velocity increases.

A voice based calculator by using termux api in Android

Real-Time Spherical Microphone Renderer for binaural reproduction in Python

Implicit neural differentiable FM synthesizer

A Youtube audio player for your terminal

live coding in python + supercollider

A python wrapper for REAPER

GNU Radio – the Free and Open Software Radio Ecosystem

Desktop music recognition application for windows

Welcome to Nexus. Your personal virtual assistant

Automatically move or copy files based on metadata associated with the files. For example, file your photos based on EXIF metadata or use MP3 tags to file your music files.

GNOME powered sound conversion

MUSIC-AVQA, CVPR2022 (ORAL)