Single-Shot Motion Completion with Transformer

Last update: Dec 29, 2022

Related tags

Overview

Single-Shot Motion Completion with Transformer

👉 [Preprint] 👈

Abstract

Motion completion is a challenging and long-discussed problem, which is of great significance in film and game applications. For different motion completion scenarios (in-betweening, in-filling, and blending), most previous methods deal with the completion problems with case-by-case designs. In this work, we propose a simple but effective method to solve multiple motion completion problems under a unified framework and achieves a new state of the art accuracy under multiple evaluation settings. Inspired by the recent great success of attention-based models, we consider the completion as a sequence to sequence prediction problem. Our method consists of two modules - a standard transformer encoder with self-attention that learns long-range dependencies of input motions, and a trainable mixture embedding module that models temporal information and discriminates key-frames. Our method can run in a non-autoregressive manner and predict multiple missing frames within a single forward propagation in real time. We finally show the effectiveness of our method in music-dance applications.

State-of-the-art on Lafan1 dataset

With the help of Transformer, we achieve a new SOTA result on Lafan1 dataset.

Lengths = 30	L2Q	L2P	NPSS
Zero-Vel	1.51	6.60	0.2318
Interp.	0.98	2.32	0.2013
ERD-QV	0.69	1.28	0.1328
Ours	0.61	1.10	0.1222

Some results (blue appearaces represent keyframes):

Dance Infilling on Anidance Dataset

We also evaluate our method on the Anidance dataset:

Infilling on the test set (black skeletons are the keyframes):

(From Left to Right: Ours, Interp. and Ground Truth)

Infilling on random keyframes (keyframes are randomly chosen from the test set with a random order for simulating in-the-wild scenario):

(From Left to Right: Ours, Interp. and Ground Truth)

Dance blending

Our method can also work on complex dance movement completion:

Code

Coming soon

Citation

@misc{duan2021singleshot,
      title={Single-Shot Motion Completion with Transformer}, 
      author={Yinglin Duan and Tianyang Shi and Zhengxia Zou and Yenan Lin and Zhehui Qian and Bohan Zhang and Yi Yuan},
      year={2021},
      eprint={2103.00776},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Single-Shot Motion Completion with Transformer

Related tags

Overview

Single-Shot Motion Completion with Transformer

Abstract

State-of-the-art on Lafan1 dataset

Dance Infilling on Anidance Dataset

Dance blending

Code

Citation

Owner

FuxiCV

Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"

exponential adaptive pooling for PyTorch

An official source code for "Augmentation-Free Self-Supervised Learning on Graphs"

Meta Learning Backpropagation And Improving It (VSML)

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

[CVPR2021] De-rendering the World's Revolutionary Artefacts

Personal implementation of paper "Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval"

Indonesian Car License Plate Character Recognition using Tensorflow, Keras and OpenCV.

PyTorch implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation.

An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi

Aiming at the common training datsets split, spectrum preprocessing, wavelength select and calibration models algorithm involved in the spectral analysis process

A pytorch implementation of Detectron. Both training from scratch and inferring directly from pretrained Detectron weights are available.

A lightweight library designed to accelerate the process of training PyTorch models by providing a minimal

Text-to-Image generation

This is the official repository of XVFI (eXtreme Video Frame Interpolation)

Tools for manipulating UVs in the Blender viewport.

Self-supervised Product Quantization for Deep Unsupervised Image Retrieval - ICCV2021

Doosan robotic arm, simulation, control, visualization in Gazebo and ROS2 for Reinforcement Learning.

Residual Pathway Priors for Soft Equivariance Constraints

Code for our paper "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization", ACL 2021