PyTorch implementation of the cross-modality generative model that synthesizes dance from music.

Last update: Dec 26, 2022

Related tags

Deep Learning Dancing2Music

Overview

Dancing to Music

PyTorch implementation of the cross-modality generative model that synthesizes dance from music.

Paper

Hsin-Ying Lee, Xiaodong Yang, Ming-Yu Liu, Ting-Chun Wang, Yu-Ding Lu, Ming-Hsuan Yang, Jan Kautz
Dancing to Music Neural Information Processing Systems (NeurIPS) 2019
[Paper] [YouTube] [Project] [Blog] [Supp]

Example Videos

Beat-Matching
1st row: generated dance sequences, 2nd row: music beats, 3rd row: kinematics beats

Multimodality
Generate various dance sequences with the same music and the same initial pose.

Long-Term Generation
Seamlessly generate a dance sequence with arbitrary length.

Photo-Realisitc Videos
Map generated dance sequences to photo-realistic videos.

Train Decomposition

python train_decomp.py --name Decomp

Train Composition

python train_comp.py --name Decomp --decomp_snapshot DECOMP_SNAPSHOT

Demo

python demo.py --decomp_snapshot DECOMP_SNAPSHOT --comp_snapshot COMP_SNAPSHOT --aud_path AUD_PATH --out_file OUT_FILE --out_dir OUT_DIR --thr THR

Flags
- aud_path: input .wav file
- out_file: location of output .mp4 file
- out_dir: directory of output frames
- thr: threshold based on motion magnitude
- modulate: whether to do beat warping
Example

python demo.py -decomp_snapshot snapshot/Stage1.ckpt --comp_snapshot snapshot/Stage2.ckpt --aud_path demo/demo.wav --out_file demo/out.mp4 --out_dir demo/out_frame

Citation

If you find this code useful for your research, please cite our paper:

@inproceedings{lee2019dancing2music,
  title={Dancing to Music},
  author={Lee, Hsin-Ying and Yang, Xiaodong and Liu, Ming-Yu and Wang, Ting-Chun and Lu, Yu-Ding and Yang, Ming-Hsuan and Kautz, Jan},
  booktitle={NeurIPS},
  year={2019}
}

License

Copyright (C) 2020 NVIDIA Corporation. All rights reserved. This work is made available under NVIDIA Source Code License (1-Way Commercial). To view a copy of this license, visit https://nvlabs.github.io/Dancing2Music/LICENSE.txt.

PyTorch implementation of the cross-modality generative model that synthesizes dance from music.

Related tags

Overview

Dancing to Music

Paper

Example Videos

Train Decomposition

Train Composition

Demo

Citation

License

Owner

NVIDIA Research Projects

PyQt6 configuration in yaml format providing the most simple script.

Easy-to-use library to boost AI inference leveraging state-of-the-art optimization techniques.

TakeInfoatNistforICS - Take Information in NIST NVD for ICS

Using Self-Supervised Pretext Tasks for Active Learning - Official Pytorch Implementation

📝 Wrapper library for text generation / language models at char and word level with RNN in TensorFlow

🗣️ Microsoft Edge TTS for Home Assistant, no need for app_key

这是一个利用facenet和retinaface实现人脸识别的库，可以进行在线的人脸识别。

Enhancing Aspect-Based Sentiment Analysis with Supervised Contrastive Learning.

Orange Chicken: Data-driven Model Generalizability in Crosslinguistic Low-resource Morphological Segmentation

Escaping the Gradient Vanishing: Periodic Alternatives of Softmax in Attention Mechanism

Official Repository for our ICCV2021 paper: Continual Learning on Noisy Data Streams via Self-Purified Replay

Code for Transformer Hawkes Process, ICML 2020.

The Fundamental Clustering Problems Suite (FCPS) summaries 54 state-of-the-art clustering algorithms, common cluster challenges and estimations of the number of clusters as well as the testing for cluster tendency.

Deep Learning applied to Integral data analysis

EdiBERT is a generative model based on a bi-directional transformer, suited for image manipulation

An Intelligent Self-driving Truck System For Highway Transportation

MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification

Non-Homogeneous Poisson Process Intensity Modeling and Estimation using Measure Transport

Deep Dual Consecutive Network for Human Pose Estimation (CVPR2021)

This is a TensorFlow implementation for C2-Rec