A package for music online and offline rhythmic information analysis including music Beat, downbeat, tempo and meter tracking.

Last update: Dec 27, 2022

Related tags

Deep Learning BeatNet

Overview

BeatNet

A package for music online and offline rhythmic information analysis including music Beat, downbeat, tempo and meter tracking.

This repository contains the user package and the source code of the Monte Carlo particle flitering inference model of the "BeatNet" music online joint beat/downbeat/tempo/meter tracking system. The arxiv version of the original ISMIR-2021 paper:

arXiv 2108.03576

In addition to the proposed online inference, we added madmom's DBN beat/downbeat inference model for the offline usages. Note that, the offline model still utilize BeatNet's neural network rather than that of Madmom which leads to better performance and significantly faster results.

Note: All models are trained using pytorch and are included in the models folder. In order to recieve the training script and the datasets data/feature handlers, shoot me an email at mheydari [at] ur.rochester.edu

System Input:

Raw audio waveform

System Output:

A vector including beats and downbeats columns, respectively with the following shape: numpy_array(num_beats, 2).

Installation command:

Approach #1: Installing binaries from the pypi website:

pip install BeatNet

Approach #2: Installing directly from the Git repository:

pip install git+https://github.com/mjhydri/BeatNet

Usage example:

From BeatNet.BeatNet import BeatNet

estimator = BeatNet(1) 

Output = estimator.process("music file directory", inference_model= 'PF', plot = True)

A brief video tutorial of the system (Overview):

In order to demonstrate the performance of the system for different beat/donbeat tracking difficulties, here are three video demo examples :

1: Song Difficulty: Easy

2: Song difficulty: Medium

3: Song difficulty: Veteran

Acknowledgements:

For the input feature extraction and implementing of the beat state space, Librosa and Madmom libraries are ustilzed. Many thanks for their great jobs. This work has been partially supported by the National Science Foundation grants 1846184 and DGE-1922591.

References:

M. Heydari, F. Cwitkowitz, and Z. Duan, “BeatNet:CRNN and particle filtering for online joint beat down-beat and meter tracking,” in Proc. of the 22th Intl. Conf.on Music Information Retrieval (ISMIR), 2021.

M. Heydari and Z. Duan, “Don’t Look Back: An online beat tracking method using RNN and enhanced particle filtering,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP), 2021.

A package for music online and offline rhythmic information analysis including music Beat, downbeat, tempo and meter tracking.

Related tags

Overview

BeatNet

System Input:

System Output:

Installation command:

Usage example:

A brief video tutorial of the system (Overview):

Acknowledgements:

References:

Owner

Mojtaba Heydari

Official repository accompanying a CVPR 2022 paper EMOCA: Emotion Driven Monocular Face Capture And Animation. EMOCA takes a single image of a face as input and produces a 3D reconstruction. EMOCA sets the new standard on reconstructing highly emotional images in-the-wild

Rethinking Nearest Neighbors for Visual Classification

ConE: Cone Embeddings for Multi-Hop Reasoning over Knowledge Graphs

RAANet: Range-Aware Attention Network for LiDAR-based 3D Object Detection with Auxiliary Density Level Estimation

A lightweight face-recognition toolbox and pipeline based on tensorflow-lite

Official implementation of Meta-StyleSpeech and StyleSpeech

A 1.3B text-to-image generation model trained on 14 million image-text pairs

Implement of homography net by pytorch

DeepStruc is a Conditional Variational Autoencoder which can predict the mono-metallic nanoparticle from a Pair Distribution Function.

Anomaly detection in multi-agent trajectories: Code for training, evaluation and the OpenAI highway simulation.

Faster RCNN pytorch windows

Python implementation of Bayesian optimization over permutation spaces.

An updated version of virtual model making

Optimized code based on M2 for faster image captioning training

This repo contains the implementation of the algorithm proposed in Off-Belief Learning, ICML 2021.

Contour-guided image completion with perceptual grouping (BMVC 2021 publication)

It's a powerful version of linebot

So-ViT: Mind Visual Tokens for Vision Transformer

Pacman-AI - AI project designed by UC Berkeley. Designed reflex and minimax agents for the game Pacman.

Motion Reconstruction Code and Data for Skills from Videos (SFV)