InverseMV: Composing Piano Scores with a Convolutional Video-Music Transformer

This repository contains the code and examples for our paper InverseMV: Composing Piano Scores with a Convolutional Video-Music Transformer. Video-Music Transformer (VMT) is an attention-based multi-modal model, which generates piano music for a given video.

Our Dataset

We release a new dataset composed of over 7 hours of piano scores with fine alignment between pop music videos and MIDI files. Our complete InverseMV dataset is available here.

Demo

Here is an example video fragments from our dataset. Note that we do not do any post-production. Each file is made from the original video with a WAVE file converted from the MIDI of the model output.

Original

The original music of the videos.

100-001_original.mp4

VMT

The music generated by our VMT model.

100-001_vmt.mp4

Seq2Seq

The musics generated by the baseline Seq2Seq model.

100-001_seq2seq.mp4

Citation

Please cite our paper if you use InverseMV in your work:

@article{lin2021inversemv,
  title={InverseMV: Composing Piano Scores with a Convolutional Video-Music Transformer},
  author={Lin, Chin-Tung and Yang, Mu},
  journal={arXiv preprint arXiv:2112.15320},
  year={2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
docs/data		docs/data
lib		lib
problem		problem
utils		utils
vmgen		vmgen
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs/data

docs/data

lib

lib

problem

problem

utils

utils

vmgen

vmgen

.gitignore

.gitignore

README.md

README.md

Repository files navigation

InverseMV: Composing Piano Scores with a Convolutional Video-Music Transformer

Our Dataset

Demo

Original

VMT

Seq2Seq

Citation

About

Releases

Packages

Contributors 2

Languages

linchintung/VMT

Folders and files

Latest commit

History

Repository files navigation

InverseMV: Composing Piano Scores with a Convolutional Video-Music Transformer

Our Dataset

Demo

Original

VMT

Seq2Seq

Citation

About

Resources

Stars

Watchers

Forks

Languages