A design of MIDI language for music generation task, specifically for Natural Language Processing (NLP) models.

Last update: May 25, 2022

Related tags

Text Data & NLP midi_language

Overview

MIDI Language

Introduction

Reference

Paper: Pop Music Transformer: Beat-based Modeling and Generation of Expressive Pop Piano Compositions: code

This is a modified version with an extension of multi-instrumental support.

Function

Convert Midi into event sequence, and represented by mapped integer array.

This could send to NLP models for AI auto music composition.

Due to this project considers more about music structures as well as its chord and melody on higher level, including note, drum, tempo, musical instrument (program in midi) and its expressions (tempo and velocity), rather than digging into too much details like sound source & direction, instrumental performing techniques (such as, bend sound, piano sustain pedal, violin overtones), the language of MIDI is design this way (see chapter Details below).

Usage

See language.py, it contains procedures:

load w2i (word to integer) and i2w (integer to word), for not calculating it every time;
encode midi to iteger array, each object handle one mid file;
decode integer array to midi, each object handle many results and export to mid files;

The code language.py has arguments:

input: input file of audio file to encode/decode;
output: output file of audio file to encode;
train: if have, it will switch to training mode with variations (data augmentation);

MidiEncoder data augmentation:

pitch_variation_range: a random pitch shift within a range for whole midi;
velocity_scale_variation_range: a random note/drum velocity scale for whole midi;
velocity_noise_scale_variation_range: a random note/drum velocity scale for each element within midi;
tempo_scale_variation_range: a random tempo change for whole midi;

MidiDecoder needs numerator and denominator time signatures for reconstructing midi files.

Details

Event Structure

Required:

Bar
Position (0~split-1)

Optional:

note:
- Note
- Program (0~127)
- Pitch (0~127)
- Velocity (0~127)
- Duration (0~split*bar_scale-1)
drum:
- Drum
- Program (0~127)
- Pitch (0~127)
- Velocity (0~127)
- Duration (0~split*bar_scale-1)
chord:
- Chord (chroma_name:chord_name)
tempo:
- Tempo_Class (T0~Ti)
- Tempo_Value (0~59)

A design of MIDI language for music generation task, specifically for Natural Language Processing (NLP) models.

Related tags

Overview

MIDI Language

Introduction

Reference

Function

Usage

Details

Event Structure

Required:

Optional:

Owner

Robert Bogan Kang

Shellcode antivirus evasion framework

GraphNLI: A Graph-based Natural Language Inference Model for Polarity Prediction in Online Debates

Python interface for converting Penn Treebank trees to Stanford Dependencies and Universal Depenencies

leaking paid token generator that was a shit lmao for 100$ haha

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

NLP, before and after spaCy

Translate U is capable of translating the text present in an image from one language to the other.

Labelling platform for text using distant supervision

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Easy, fast, effective, and automatic g-code compression!

Code for our paper "Transfer Learning for Sequence Generation: from Single-source to Multi-source" in ACL 2021.

Biterm Topic Model (BTM): modeling topics in short texts

lightweight, fast and robust columnar dataframe for data analytics with online update

T‘rex Park is a Youzan sponsored project. Offering Chinese NLP and image models pretrained from E-commerce datasets

Codename generator using WordNet parts of speech database

A full spaCy pipeline and models for scientific/biomedical documents.

A Word Level Transformer layer based on PyTorch and 🤗 Transformers.

硕士期间自学的NLP子任务，供学习参考

Google and Stanford University released a new pre-trained model called ELECTRA

Tensorflow Implementation of A Generative Flow for Text-to-Speech via Monotonic Alignment Search