Code of the lileonardo team for the 2021 Emotion and Theme Recognition in Music task of MediaEval 2021

Last update: Aug 02, 2022

Related tags

Overview

Emotion and Theme Recognition in Music

The repository contains code for the submission of the lileonardo team to the 2021 Emotion and Theme Recognition in Music task of MediaEval 2021 (results).

Requirements

python >= 3.7
pip install -r requirements.txt in a virtual environment
Download data from the MTG-Jamendo Dataset in data/jamendo. Audio files go to data/jamendo/mp3 and melspecs to data/jamendo/melspecs.
Process 128 bands mel spectrograms and store them in data/jamendo/melspecs2 by running:
```
python preprocess.py experiments/preprocessing/melspecs2.json
```

Usage

Run python main.py experiments/DIR where DIR contains the parameters.

Parameters are overridable by command line arguments:

python main.py --help

usage: main.py [-h] [--data_dir DATA] [--num_workers NUM] [--restart_training] [--restore_name NAME]
               [--num_epochs EPOCHS] [--learning_rate LR] [--weight_decay WD] [--dropout DROPOUT]
               [--batch_size BS] [--manual_seed SEED] [--model MODEL] [--loss LOSS]
               [--calculate_stats]
               DIRECTORY

Train according to parameters in DIRECTORY

positional arguments:
  DIRECTORY            path of the directory containing parameters

optional arguments:
  -h, --help           show this help message and exit
  --data_dir DATA      path of the directory containing data (default: data)
  --num_workers NUM    number of workers for dataloader (default: 4)
  --restart_training   overwrite previous training (default is to resume previous training)
  --restore_name NAME  name of checkpoint to restore (default: last)
  --num_epochs EPOCHS  override number of epochs in parameters
  --learning_rate LR   override learning rate
  --weight_decay WD    override weight decay
  --dropout DROPOUT    override dropout
  --batch_size BS      override batch size
  --manual_seed SEED   override manual seed
  --model MODEL        override model
  --loss LOSS          override loss
  --calculate_stats    recalculate mean and std of data (default is to calculate only when they
                       don't exist in parameters)

Ensemble predictions

The predictions are averaged by running:

python average.py --outputs experiments/convs-m96*/predictions/test-last-swa-outputs.npy --targets experiments/convs-m96*/predictions/test-last-swa-targets.npy --preds_path predictions/convs.npy

python average.py --outputs experiments/filters-m128*/predictions/test-last-swa-outputs.npy --targets experiments/filters-m128*/predictions/test-last-swa-targets.npy --preds_path predictions/filters.npy

python average.py --outputs predictions/convs.npy predictions/filters.npy --targets predictions/targets.npy

Code of the lileonardo team for the 2021 Emotion and Theme Recognition in Music task of MediaEval 2021

Related tags

Overview

Emotion and Theme Recognition in Music

Requirements

Usage

Ensemble predictions

Owner

Vincent Bour

Implementation of CVPR'21: RfD-Net: Point Scene Understanding by Semantic Instance Reconstruction

List of papers, code and experiments using deep learning for time series forecasting

The devkit of the nuPlan dataset.

KoCLIP: Korean port of OpenAI CLIP, in Flax

🐦 Quickly annotate data from the comfort of your Jupyter notebook

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling @ INTERSPEECH 2021 Accepted

Official implementation of the ICML2021 paper "Elastic Graph Neural Networks"

Domain Generalization with MixStyle, ICLR'21.

Multi-Scale Progressive Fusion Network for Single Image Deraining

This is the repository for CVPR2021 Dynamic Metric Learning: Towards a Scalable Metric Space to Accommodate Multiple Semantic Scales

TensorFlow-based neural network library

MWPToolkit is a PyTorch-based toolkit for Math Word Problem (MWP) solving.

Angular & Electron desktop UI framework. Angular components for native looking and behaving macOS desktop UI (Electron/Web)

Understanding the Properties of Minimum Bayes Risk Decoding in Neural Machine Translation.

Tensorflow implementation of MIRNet for Low-light image enhancement

Create UIs for prototyping your machine learning model in 3 minutes

Official implementation of "Articulation Aware Canonical Surface Mapping"

Plug and play transformer you can find network structure and official complete code by clicking List

Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)

This repo is the code release of EMNLP 2021 conference paper "Connect-the-Dots: Bridging Semantics between Words and Definitions via Aligning Word Sense Inventories".