music-ai

Deep learning transformer model that generates unique music sequences.

Abstract

In 2017, a new state-of-the-art was published for natural language processing: the Transformer. Relying solely on attention mechanisms, the Transformer outperformed existing solutions based on recurrent and convolutional neural networks¹. However, recurrent neural networks, long short-term memory, and gated recurrent neural networks remain dominant in the field of generative music. I aim to introduce the Transformer into the field of music, with the goal of teaching the deep learning model to predict the second half of a composition given the first half. A Transformer equipped with 32 attention heads and sinusoidal positional encoding was trained on the Nottingham MIDI dataset for 5000 epochs over a period of 48 hours, optimized by stochastic gradient descent and measured with cross entropy loss, and regulated by an exponential learning rate decrease schedule. For the first thousand epochs, the model had noticeable improvement but lacked arrangement to the generated sequences. By five thousand epochs, the model clearly demonstrated the knowledge of general music trends used to better predict how classical composers write their pieces, and most tracks were melodic to the human ear. Future applications of this technique include generating tracks for various instruments, rating the quality of existing music tracks, and complete originality if combined with a generative network mapping melodies to latent space.

¹ Attention Is All You Need

Video

Hardware

Ubuntu

32 GB RAM
Intel Core i3-4170 CPU @3.70 GHz x4 (4 GB RAM)
NVIDIA GeForce GTX 1050 Ti

Deep learning transformer model that generates unique music sequences.

Related tags

Overview

music-ai

Abstract

Video

Hardware

Owner

xacer

A collection of free MIDI chords and progressions ready to be used in your DAW, Akai MPC, or Roland MC-707/101

Open Sound Strip, Sequence or Record in Audacity

Voicefixer aims at the restoration of human speech regardless how serious its degraded.

𝙰 𝙼𝚞𝚜𝚒𝚌 𝙱𝚘𝚝 𝙲𝚛𝚎𝚊𝚝𝚎𝚍 𝙱𝚢 𝚃𝚎𝚊𝚖𝙳𝚕𝚝 💖

🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.

A tool for retrieving audio in the past

Generating a structured library of .wav samples with Python.

Port Hitsuboku Kumi Chinese CVVC voicebank to deepvocal. / 筆墨クミDeepvocal中文音源

Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.

NovaMusic is a music sharing robot. Users can get music and music lyrics using inline queries.

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

This is a short program that takes the input from your microphone and uses OpenGL to draw a live colourful pattern

A python library for working with praat, textgrids, time aligned audio transcripts, and audio files.

Python CD-DA ripper preferring accuracy over speed

A Music Player Bot for Discord Servers

Basically Play Pauses the song when it is safe to do so. when you die in a round

python wrapper for rubberband

Reading list for research topics in sound event detection

This is a python package that turns any images into MIDI files that views the same as them

GNOME powered sound conversion