PyTorch implementation of the cross-modality generative model that synthesizes dance from music.

Last update: Dec 26, 2022

Related tags

Deep Learning Dancing2Music

Overview

Dancing to Music

PyTorch implementation of the cross-modality generative model that synthesizes dance from music.

Paper

Hsin-Ying Lee, Xiaodong Yang, Ming-Yu Liu, Ting-Chun Wang, Yu-Ding Lu, Ming-Hsuan Yang, Jan Kautz
Dancing to Music Neural Information Processing Systems (NeurIPS) 2019
[Paper] [YouTube] [Project] [Blog] [Supp]

Example Videos

Beat-Matching
1st row: generated dance sequences, 2nd row: music beats, 3rd row: kinematics beats

Multimodality
Generate various dance sequences with the same music and the same initial pose.

Long-Term Generation
Seamlessly generate a dance sequence with arbitrary length.

Photo-Realisitc Videos
Map generated dance sequences to photo-realistic videos.

Train Decomposition

python train_decomp.py --name Decomp

Train Composition

python train_comp.py --name Decomp --decomp_snapshot DECOMP_SNAPSHOT

Demo

python demo.py --decomp_snapshot DECOMP_SNAPSHOT --comp_snapshot COMP_SNAPSHOT --aud_path AUD_PATH --out_file OUT_FILE --out_dir OUT_DIR --thr THR

Flags
- aud_path: input .wav file
- out_file: location of output .mp4 file
- out_dir: directory of output frames
- thr: threshold based on motion magnitude
- modulate: whether to do beat warping
Example

python demo.py -decomp_snapshot snapshot/Stage1.ckpt --comp_snapshot snapshot/Stage2.ckpt --aud_path demo/demo.wav --out_file demo/out.mp4 --out_dir demo/out_frame

Citation

If you find this code useful for your research, please cite our paper:

@inproceedings{lee2019dancing2music,
  title={Dancing to Music},
  author={Lee, Hsin-Ying and Yang, Xiaodong and Liu, Ming-Yu and Wang, Ting-Chun and Lu, Yu-Ding and Yang, Ming-Hsuan and Kautz, Jan},
  booktitle={NeurIPS},
  year={2019}
}

License

Copyright (C) 2020 NVIDIA Corporation. All rights reserved. This work is made available under NVIDIA Source Code License (1-Way Commercial). To view a copy of this license, visit https://nvlabs.github.io/Dancing2Music/LICENSE.txt.

PyTorch implementation of the cross-modality generative model that synthesizes dance from music.

Related tags

Overview

Dancing to Music

Paper

Example Videos

Train Decomposition

Train Composition

Demo

Citation

License

Owner

NVIDIA Research Projects

PiCIE: Unsupervised Semantic Segmentation using Invariance and Equivariance in clustering (CVPR2021)

Source code for paper "Document-Level Relation Extraction with Adaptive Thresholding and Localized Context Pooling", AAAI 2021

Elastic weight consolidation technique for incremental learning.

PyTorch Implementation for AAAI'21 "Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-turn Response Selection"

Data-Driven Operational Space Control for Adaptive and Robust Robot Manipulation

A Traffic Sign Recognition Project which can help the driver recognise the signs via text as well as audio. Can be used at Night also.

Fake videos detection by tracing the source using video hashing retrieval.

[WWW 2021] Source code for "Graph Contrastive Learning with Adaptive Augmentation"

BTC-Generator - BTC Generator With Python

Bridging Vision and Language Model

paper: Hyperspectral Remote Sensing Image Classification Using Deep Convolutional Capsule Network

Barlow Twins and HSIC

Machine learning Bot detection technique, based on United States election dataset

STEM: An approach to Multi-source Domain Adaptation with Guarantees

Code Release for the paper "TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation"

Neural Dynamic Policies for End-to-End Sensorimotor Learning

Project to create an open-source 6 DoF input device

MMdet2-based reposity about lightweight detection model: Nanodet, PicoDet.

Benchmark for Answering Existential First Order Queries with Single Free Variable

A trashy useless Latin programming language written in python.