Pytorch implementation of the popular Improv RNN model originally proposed by the Magenta team.

Last update: Nov 11, 2022

Overview

Pytorch Implementation of Improv RNN

Overview

This code is a pytorch implementation of the popular Improv RNN model originally implemented by the Magenta team. The model is able to generate melodies conditioned on a given chord progression.
The specific model implemented in this repository is the Chord Pitches Improv model which encodes chords as the concatenation of the following length-12 vectors:

a one-hot encoding of the chord root pitch class, e.g. [0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0] for a D major (or minor, etc.) chord
a binary vector indicating presence or absence of each pitch class, e.g. [1, 0, 0, 1, 1, 0, 0, 1, 0, 0, 1, 0] for a C7#9 chord
a one-hot encoding of the chord bass pitch class, which is usually the same as the chord root pitch class except in the case of "slash chords" like C/E

You can either use a pre-trained checkpoint of the model or train your own using the steps below.

Installation

Install Required Libraries

pip install -r requirements.txt

Generate a Melody Given Backing Chords

A pretrained checkpoint of the model can be found in the "checkpoints" folder. The checkpoint has been trained for 1000 epochs on the OpenEWLD dataset.

python 003_generate_melody.py --backing_chords "C G Am F C G F C" --output out.mid

This will generate a melody starting with a middle C over the chord progression C G Am F C G F C, where each chord lasts one bar. You can modify the backing chords as you like using the backing_chords parameter. You can define where the generated midi file should be saved with the output parameter.

An example of the generated RNN features is visualized here:

Train Your Own Model

Download OpenEWLD Dataset

To train the model, the OpenEWLD dataset is used. OpenEWLD is a subset of the Wikifonia Leadsheet Dataset reduced to only copyright free songs. A lead sheet is a musical score that contains a notation of the melody and the underlying chord progression of a song.
The song examples are in the compressed musicxml (*.MXL) format which can be parsed in to sequences of note events using the note-seq library.

Dataset Preparation

Extract features from musicxml files and store them in a h5 file.

python 001_create_dataset.py --input C:/Datasets/OpenEWLD/dataset

Training

Track metrics using Tensorboard

python 002_train.py --num_epochs 1000

Track metrics using Tensorboard

tensorboard --logdir ./logs/

The curves of the loss and accuracy over the training epochs are shown in tensorboard:

Pytorch implementation of the popular Improv RNN model originally proposed by the Magenta team.

Related tags

Overview

Pytorch Implementation of Improv RNN

Overview

Installation

Install Required Libraries

Generate a Melody Given Backing Chords

Train Your Own Model

Download OpenEWLD Dataset

Dataset Preparation

Training

Owner

Sebastian Murgul

Implementation of the paper ''Implicit Feature Refinement for Instance Segmentation''.

Finding an Unsupervised Image Segmenter in each of your Deep Generative Models

Pmapper is a super-resolution and deconvolution toolkit for python 3.6+

TorchX is a library containing standard DSLs for authoring and running PyTorch related components for an E2E production ML pipeline.

Code to go with the paper "Decentralized Bayesian Learning with Metropolis-Adjusted Hamiltonian Monte Carlo"

Offline Reinforcement Learning with Implicit Q-Learning

A PyTorch implementation of deep-learning-based registration

An Approach to Explore Logistic Regression Models

This repository is for our paper Exploiting Scene Graphs for Human-Object Interaction Detection accepted by ICCV 2021.

UDP++ (ECCVW 2020 Oral), (Winner of COCO 2020 Keypoint Challenge).

Code for all the Advent of Code'21 challenges mostly written in python

Implementation of Diverse Semantic Image Synthesis via Probability Distribution Modeling

A very simple tool to rewrite parameters such as attributes and constants for OPs in ONNX models. Simple Attribute and Constant Modifier for ONNX.

Leaf: Multiple-Choice Question Generation

Bilinear attention networks for visual question answering

Official PyTorch implementation of RIO

[CVPR 2022] PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision (Oral)

Demonstrates how to divide a DL model into multiple IR model files (division) and introduce a simplest way to implement a custom layer works with OpenVINO IR models.

EmoTag helps you train emotion detection model for Chinese audios

This repository contains the implementation of the following paper: Cross-Descriptor Visual Localization and Mapping