In this project we can see how we can generate automatic music using character RNN.

Overview

Automatic Music Genaration

Table of Contents

  1. Project Description
  2. Approach towards the problem
  3. Limitations
  4. Libraries Used
  5. Summary
  6. Applications
  7. References


1. Project Description

  • Natural Language Processing in Artificial Intelligence is the application of computational techniques to the analysis and synthesis of natural language and speech.

  • For a hypothetical scenario it was assumed that ABC music prod. pvt.ltd is a reknowned audio-video production house based out of Mumbai, India

  • As COVID-19 cases are increasing day by day it is almost impossible for the musicians to coop up with real time studio work.

  • Hence, the company wants you to make an AI based music genaration system.

  • The goal of this project is to make an AI based music genaration system.

  • The key contraint to the problem is accruracy.

  • You have been hired as a freelance data scientist for ABC music prod. pvt.ltd

  • The model should read a text file in abc format.

  • The model should genarate the corresponding music framed out of that note sequence.


2. Approach towards the problem

  • The approach to this project was to make a chat bot from scratch.

  • Initially all the necessary libraries were imported and installed.

  • Then, the data corpus was formed.

  • A data corpus is a collection of linguistic data.

  • Then while performing preprocessing we used lemitization technique in order to process the data into desirable format.

  • Then after preprocessing, we genarated various batches to fit into the model.

  • After that we build a charecter RNN model and trained the model for 100 epochs with a batch size of 16 and a sequence length of 64

  • For every 10 epochs the model would save the updated weights in the main directory

  • We recieved a validation accuracy of 92%

  • Now we genarated the music code which is nothing but the predicted vocabulary

  • After that we went here and we were successfully able to genarate music.

3. Limitation(s)

  • The only limitation of this model is that it is being trained with very less data.

  • However, while getting trained on various intrument data this model can be further enhanced for different instruments as well.

  • We have trained this model for only 100 epochs. As the number of epochs increase it is expected that the accuracy of the model will increase.

4. Libraries Used

Following are the list of libraries that were used for making this project.

  • Python was used as the general purposae programming language

  • Keras was used to perform all Deep Learning operation(s) such as model bulding, compliation and training.

  • numpy was used in order to calcucate numercal operations.

5. Summary

  • In this project an Automatic music genaration system was made from scratch.

  • Here, we recieved a validation accuracy of 92%

  • This project can be widely used for music production systems

  • The only limitation of this model is that it is being trained with very less data.

  • However, while getting trained on various intrument data this model can be further enhanced for different instruments as well.

  • This model can be used for in house music production systems.

  • This can be widely used to automate manual instruments.

  • This can be also used to make automatic VST(virtual studio toolkit) plugins

6. Applications

  • This model can be used for in house music production systems.

  • This can be widely used to automate manual instruments.

  • This can be also used to make automatic VST(virtual studio toolkit) plugins

7. References

Owner
Pronay Ghosh
A data-driven professional who grows day by day. Currently working as a Data Science Researcher at The International School of AI and Data Science.
Pronay Ghosh
Codes for "Efficient Long-Range Attention Network for Image Super-resolution"

ELAN Codes for "Efficient Long-Range Attention Network for Image Super-resolution", arxiv link. Dependencies & Installation Please refer to the follow

xindong zhang 124 Dec 22, 2022
The project aims to develop a personal-assistant for Windows & Linux-based systems

The project aims to develop a personal-assistant for Windows & Linux-based systems. Samiksha draws its inspiration from virtual assistants like Cortana for Windows, and Siri for iOS. It has been desi

SHUBHANSHU RAI 1 Jan 16, 2022
Tune in is a Collaborative Music Playing Systems where multiple guests can join a room and enjoy the song being played

✨A collaborative music playing systems🎶 where multiple guests can join a room ➡🚪 and enjoy the song🎧 being played.

Vedansh Vijaywargiya 8 Nov 05, 2022
Mousai is a simple application that can identify song like Shazam

Mousai is a simple application that can identify song like Shazam. It saves the artist, album, and title of the identified song in a JSON file.

Dave Patrick 662 Jan 07, 2023
Cobra is a highly-accurate and lightweight voice activity detection (VAD) engine.

On-device voice activity detection (VAD) powered by deep learning.

Picovoice 88 Dec 16, 2022
Open-Source Tools & Data for Music Source Separation: A Pragmatic Guide for the MIR Practitioner

Open-Source Tools & Data for Music Source Separation: A Pragmatic Guide for the MIR Practitioner

IELab@ Korea University 0 Nov 12, 2021
Conferencing Speech Challenge

ConferencingSpeech 2021 challenge This repository contains the datasets list and scripts required for the ConferencingSpeech challenge. For more detai

73 Nov 29, 2022
Voicefixer aims at the restoration of human speech regardless how serious its degraded.

Voicefixer aims at the restoration of human speech regardless how serious its degraded.

Leo 324 Dec 26, 2022
Audio library for modelling loudness

Loudness Loudness is a C++ library with Python bindings for modelling perceived loudness. The library consists of processing modules which can be casc

Dominic Ward 33 Oct 02, 2022
A fast MDCT implementation using SciPy and FFTs

MDCT A fast MDCT implementation using SciPy and FFTs Installation As usual pip install mdct Dependencies NumPy SciPy STFT Usage import mdct spectrum

Nils Werner 43 Sep 02, 2022
Play any song directly into your group voice chat.

Telegram VCPlayer Bot Play any song directly into your group voice chat. Official Bot : VCPlayerBot | Discussion Group : VoiceChat Music Player Suppor

Shubham Kumar 50 Nov 21, 2022
All-In-One Digital Audio Workstation and Plugin Suite

How to install Windows Mac OS X Fedora Ubuntu How to Build Debian and Ubuntu Fedora All Other Linux Distros Mac OS X Windows What is MusiKernel? MusiK

j3ffhubb 111 Sep 21, 2021
SU Music Player — The first open-source PyTgCalls based Pyrogram bot to play music in voice chats

SU Music Player — The first open-source PyTgCalls based Pyrogram bot to play music in voice chats Note Neither this, or PyTgCalls are fully

SU Projects 58 Jan 02, 2023
pyo is a Python module written in C to help digital signal processing script creation.

pyo is a Python module written in C to help digital signal processing script creation.

Olivier Bélanger 1.1k Jan 01, 2023
❤️ This Is The EzilaXMusicPlayer Advaced Repo 🎵

Telegram EzilaXMusicPlayer Bot 🎵 A bot that can play music on telegram group's voice Chat ❤️ Requirements 📝 FFmpeg NodeJS nodesource.com Python 3.7+

Sadew Jayasekara 11 Nov 12, 2022
The official repository for Audio ALBERT

AALBERT Here is also the official repository of AALBERT, which is Pytorch lightning reimplementation of the paper, Audio ALBERT: A Lite Bert for Self-

pohan 55 Dec 11, 2022
Simple, hackable offline speech to text - using the VOSK-API.

Nerd Dictation Offline Speech to Text for Desktop Linux. This is a utility that provides simple access speech to text for using in Linux without being

Campbell Barton 844 Jan 07, 2023
Audio fingerprinting and recognition in Python

dejavu Audio fingerprinting and recognition algorithm implemented in Python, see the explanation here: How it works Dejavu can memorize audio by liste

Will Drevo 6k Jan 06, 2023
Enhanced Audio Player for Discord

Discodo is an enhanced audio player for discord

Mary 42 Oct 05, 2022
A Python 3 script for capturing and recording a SDR stream to a WAV file (or serving it to a HTTP audio stream).

rfsoapyfile A Python 3 script for capturing and recording a SDR stream to a WAV file (or serving it to a HTTP audio stream). The script is threaded fo

4 Dec 19, 2022