University of Rochester 2021 Summer REU focusing on music sentiment transfer using CycleGAN

Last update: Jan 24, 2022

Related tags

Deep Learning Audio-Sentiment-Transfer

Overview

Music-Sentiment-Transfer

University of Rochester 2021 Summer REU focusing on music sentiment transfer using CycleGAN

Poster: Music Sentiment Transfer poster.pdf

Paper: Music Sentiment Transfer.pdf

Slides: Music Sentiment Transfer.pptx

For this project, we based our network on the CycleGAN framework for symbolic music created by Brunner et al. in thier paper Symbolic Music Genre Transfer with CycleGAN.

The network we used was a Pytorch implementation written in 2021. After getting the code working, the results indicated that the GAN network was not able to generate the correct format neccessary for MIDI interpreters. On the other hand, the data processing pipeline we created, midi_to_npy.py was a valid mechansim for creating datasets that were fit for the network. The data format created by the file turns MIDI files into binary piano rolls such that the format is Time x MIDI notes. As for the results of the network, the npy files not run through the network were able to be successfuly convert back into midi files using utils from the Pytorch implementation we refrenced above.

Other tools we created include the wav_splitter folder which is able to split wav files, possibly of long musical performences, into smaller segments. Additionally, the functions also include mechansims for transforming wav files into spectrograms using torchaudio and librosa.

University of Rochester 2021 Summer REU focusing on music sentiment transfer using CycleGAN

Related tags

Overview

Music-Sentiment-Transfer

Owner

Miles Sigel

automated systems to assist guarding corona Virus precautions for Closed Rooms (e.g. Halls, offices, etc..)

🛠️ SLAMcore SLAM Utilities

How to Leverage Multimodal EHR Data for Better Medical Predictions?

Repository relating to the CVPR21 paper TimeLens: Event-based Video Frame Interpolation

LiDAR R-CNN: An Efficient and Universal 3D Object Detector

An End-to-End Machine Learning Library to Optimize AUC (AUROC, AUPRC).

VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning

Release of SPLASH: Dataset for semantic parse correction with natural language feedback in the context of text-to-SQL parsing

Ansible Automation Example: JSNAPY PRE/POST Upgrade Validation

[NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images

Original Pytorch Implementation of FLAME: Facial Landmark Heatmap Activated Multimodal Gaze Estimation

Vpw analyzer - A visual J1850 VPW analyzer written in Python

Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization

Hard cater examples from Hopper ICLR paper

CoaT: Co-Scale Conv-Attentional Image Transformers

mPose3D, a mmWave-based 3D human pose estimation model.

A bare-bones Python library for quality diversity optimization.

DimReductionClustering - Dimensionality Reduction + Clustering + Unsupervised Score Metrics

An integration of several popular automatic augmentation methods, including OHL (Online Hyper-Parameter Learning for Auto-Augmentation Strategy) and AWS (Improving Auto Augment via Augmentation Wise Weight Sharing) by Sensetime Research.

Algorithmic Trading using RNN