Skip to content

milesigel/Audio-Sentiment-Transfer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 

Repository files navigation

Music-Sentiment-Transfer

University of Rochester 2021 Summer REU focusing on music sentiment transfer using CycleGAN

Poster: Music Sentiment Transfer poster.pdf

Paper: Music Sentiment Transfer.pdf

Slides: Music Sentiment Transfer.pptx

For this project, we based our network on the CycleGAN framework for symbolic music created by Brunner et al. in thier paper Symbolic Music Genre Transfer with CycleGAN.

Screen Shot 2021-07-28 at 11 04 29 AM

The network we used was a Pytorch implementation written in 2021. After getting the code working, the results indicated that the GAN network was not able to generate the correct format neccessary for MIDI interpreters. On the other hand, the data processing pipeline we created, midi_to_npy.py was a valid mechansim for creating datasets that were fit for the network. The data format created by the file turns MIDI files into binary piano rolls such that the format is Time x MIDI notes. As for the results of the network, the npy files not run through the network were able to be successfuly convert back into midi files using utils from the Pytorch implementation we refrenced above.

Other tools we created include the wav_splitter folder which is able to split wav files, possibly of long musical performences, into smaller segments. Additionally, the functions also include mechansims for transforming wav files into spectrograms using torchaudio and librosa.

About

University of Rochester 2021 Summer REU focusing on music sentiment transfer using CycleGAN

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages