Pose Transformers: Human Motion Prediction with Non-Autoregressive Transformers

Last update: Dec 14, 2022

Related tags

Deep Learning potr

Overview

Pose Transformers: Human Motion Prediction with Non-Autoregressive Transformers

This is the repo used for human motion prediction with non-autoregressive transformers published with our paper

Requirements

Pytorch>=1.7.
Numpy.
Tensorboard for pytorch.

Data

We have performed experiments with 2 different datasets

H36M
NTURGB+D (60 actions)

Follow the instructions to download each dataset and place it in data.

Note. You can download the H36M dataset using wget http://www.cs.stanford.edu/people/ashesh/h3.6m.zip. However, the code expects files to be npy files instead of txt. You can use the script in data/h36_convert_txt_to_numpy.py to convert to npy files.

Training

To run training with H3.6M dataset and save experiment results in POTR_OUT folder run the following:

python training/transformer_model_fn.py \
  --model_prefix=${POTR_OUT} \
  --batch_size=16 \
  --data_path=${H36M} \
  --learning_rate=0.0001 \
  --max_epochs=500 \
  --steps_per_epoch=200 \
  --loss_fn=l1 \
  --model_dim=128 \
  --num_encoder_layers=4 \
  --num_decoder_layers=4 \
  --num_heads=4 \
  --dim_ffn=2048 \
  --dropout=0.3 \
  --lr_step_size=400 \
  --learning_rate_fn=step \
  --warmup_epochs=100 \
  --pose_format=rotmat \
  --pose_embedding_type=gcn_enc \
  --dataset=h36m_v2 \
  --pre_normalization \
  --pad_decoder_inputs \
  --non_autoregressive \
  --pos_enc_alpha=10 \
  --pos_enc_beta=500 \
  --predict_activity \
  --action=all

Where pose_embedding_type controls the type of architectures of networks to be used for encoding and decoding skeletons (\phi and \psi in our paper). See models/PoseEncoderDecoder.py for the types of architectures. Tensorboard curves and pytorch models will be saved in ${POTR_OUT}.

Citation

If you happen to use the code for your research, please cite the following paper

@inproceedings{Martinez_ICCV_2021,
author = "Mart\'inez-Gonz\'alez, A. and Villamizar, M. and Odobez, J.M.",
title = {Pose Transformers (POTR): Human Motion Prediction with Non-Autoregressive Transformers},
booktitle = {IEEE/CVF International Conference on Computer Vision - Workshops (ICCV)},
year = {2021}
}

Pose Transformers: Human Motion Prediction with Non-Autoregressive Transformers

Related tags

Overview

Pose Transformers: Human Motion Prediction with Non-Autoregressive Transformers

Requirements

Data

Training

Citation

Owner

Idiap Research Institute

A small demonstration of using WebDataset with ImageNet and PyTorch Lightning

Unofficial implementation of One-Shot Free-View Neural Talking Head Synthesis

An Unsupervised Detection Framework for Chinese Jargons in the Darknet

ONNX Runtime Web demo is an interactive demo portal showing real use cases running ONNX Runtime Web in VueJS.

This is a repo of basic Machine Learning!

Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)

[CVPRW 2021] Code for Region-Adaptive Deformable Network for Image Quality Assessment

RCD: Relation Map Driven Cognitive Diagnosis for Intelligent Education Systems

I explore rock vs. mine prediction using a SONAR dataset

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

This is an implementation of PIFuhd based on Pytorch

Code for Transformer Hawkes Process, ICML 2020.

Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

BrainGNN - A deep learning model for data-driven discovery of functional connectivity

🚀 An end-to-end ML applications using PyTorch, W&B, FastAPI, Docker, Streamlit and Heroku

【ACMMM 2021】DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning

DLL: Direct Lidar Localization

Tutorial in Python targeted at Epidemiologists. Will discuss the basics of analysis in Python 3

This is the 3D Implementation of 《Inconsistency-aware Uncertainty Estimation for Semi-supervised Medical Image Segmentation》