Multi-modal Content Creation Model Training Infrastructure including the FACT model (AI Choreographer) implementation.

Last update: Dec 30, 2022

Related tags

Deep Learning mint

Overview

AI Choreographer: Music Conditioned 3D Dance Generation with AIST++ [ICCV-2021].

Overview

This package contains the model implementation and training infrastructure of our AI Choreographer.

Get started

Pull the code

git clone https://github.com/liruilong940607/mint --recursive

Note here --recursive is important as it will automatically clone the submodule (orbit) as well.

Install dependencies

conda create -n mint python=3.7
conda activate mint
conda install protobuf numpy
pip install tensorflow absl-py tensorflow-datasets librosa

sudo apt-get install libopenexr-dev
pip install --upgrade OpenEXR
pip install tensorflow-graphics tensorflow-graphics-gpu

git clone https://github.com/arogozhnikov/einops /tmp/einops
cd /tmp/einops/ && pip install . -U

git clone https://github.com/google/aistplusplus_api /tmp/aistplusplus_api
cd /tmp/aistplusplus_api && pip install -r requirements.txt && pip install . -U

Note if you meet environment conflicts about numpy, you can try with pip install numpy==1.20.

Get the data

See the website

Get the checkpoint

Download from google drive here, and put them to the folder ./checkpoints/

Run the code

complie protocols

protoc ./mint/protos/*.proto

preprocess dataset into tfrecord

python tools/preprocessing.py \
    --anno_dir="/mnt/data/aist_plusplus_final/" \
    --audio_dir="/mnt/data/AIST/music/" \
    --split=train
python tools/preprocessing.py \
    --anno_dir="/mnt/data/aist_plusplus_final/" \
    --audio_dir="/mnt/data/AIST/music/" \
    --split=testval

run training

python trainer.py --config_path ./configs/fact_v5_deeper_t10_cm12.config --model_dir ./checkpoints

Note you might want to change the batch_size in the config file if you meet OUT-OF-MEMORY issue.

run testing and evaluation

# caching the generated motions (seed included) to `./outputs`
python evaluator.py --config_path ./configs/fact_v5_deeper_t10_cm12.config --model_dir ./checkpoints
# calculate FIDs
python tools/calculate_scores.py

Citation

@inproceedings{li2021dance,
  title={AI Choreographer: Music Conditioned 3D Dance Generation with AIST++},
  author={Ruilong Li and Shan Yang and David A. Ross and Angjoo Kanazawa},
  booktitle = {The IEEE International Conference on Computer Vision (ICCV)},
  year = {2021}
}

Multi-modal Content Creation Model Training Infrastructure including the FACT model (AI Choreographer) implementation.

Related tags

Overview

AI Choreographer: Music Conditioned 3D Dance Generation with AIST++ [ICCV-2021].

Overview

Get started

Pull the code

Install dependencies

Get the data

Get the checkpoint

Run the code

Citation

Owner

Google Research

Official implementation of the ICCV 2021 paper: "The Power of Points for Modeling Humans in Clothing".

TJU Deep Learning & Neural Network

Reducing Information Bottleneck for Weakly Supervised Semantic Segmentation (NeurIPS 2021)

Github for the conference paper GLOD-Gaussian Likelihood OOD detector

Posterior temperature optimized Bayesian models for inverse problems in medical imaging

In-place Parallel Super Scalar Samplesort (IPS⁴o)

Source code for our paper "Learning to Break Deep Perceptual Hashing: The Use Case NeuralHash"

Source code for paper "Deep Superpixel-based Network for Blind Image Quality Assessment"

验证码识别深度学习 tensorflow 神经网络

TensorFlow implementation of "Variational Inference with Normalizing Flows"

Spatial color quantization in Rust

PyTorch implementation of our Adam-NSCL algorithm from our CVPR2021 (oral) paper "Training Networks in Null Space for Continual Learning"

Neural Cellular Automata + CLIP

Restricted Boltzmann Machines in Python.

Python scripts form performing stereo depth estimation using the CoEx model in ONNX.

Send text to girlfriend in the morning

This is an open source library implementing hyperbox-based machine learning algorithms

⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.

Deal or No Deal? End-to-End Learning for Negotiation Dialogues

Unofficial Pytorch Implementation of WaveGrad2

Multi-modal Content Creation Model Training Infrastructure including the FACT model (AI Choreographer) implementation.

Related tags

Overview

AI Choreographer: Music Conditioned 3D Dance Generation with AIST++ [ICCV-2021].

Overview

Get started

Pull the code

Install dependencies

Get the data

Get the checkpoint

Run the code

Citation

Owner

Google Research

Official implementation of the ICCV 2021 paper: "The Power of Points for Modeling Humans in Clothing".

TJU Deep Learning & Neural Network

Reducing Information Bottleneck for Weakly Supervised Semantic Segmentation (NeurIPS 2021)

Github for the conference paper GLOD-Gaussian Likelihood OOD detector

Posterior temperature optimized Bayesian models for inverse problems in medical imaging

In-place Parallel Super Scalar Samplesort (IPS⁴o)

Source code for our paper "Learning to Break Deep Perceptual Hashing: The Use Case NeuralHash"

Source code for paper "Deep Superpixel-based Network for Blind Image Quality Assessment"

验证码识别 深度学习 tensorflow 神经网络

TensorFlow implementation of "Variational Inference with Normalizing Flows"

Spatial color quantization in Rust

PyTorch implementation of our Adam-NSCL algorithm from our CVPR2021 (oral) paper "Training Networks in Null Space for Continual Learning"

Neural Cellular Automata + CLIP

Restricted Boltzmann Machines in Python.

Python scripts form performing stereo depth estimation using the CoEx model in ONNX.

Send text to girlfriend in the morning

This is an open source library implementing hyperbox-based machine learning algorithms

⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.

Deal or No Deal? End-to-End Learning for Negotiation Dialogues

Unofficial Pytorch Implementation of WaveGrad2

验证码识别深度学习 tensorflow 神经网络