MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

Last update: Jan 07, 2023

Related tags

Overview

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

This repo is the official implementation of "MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation, Wenhao Li, Hong Liu, Hao Tang, Pichao Wang, Luc Van Gool" in PyTorch.

Dependencies

Cuda 11.1
Python 3.6
Pytorch 1.7.1

Dataset setup

Please download the dataset from Human3.6m website and refer to VideoPose3D to set up the Human3.6M dataset ('./dataset' directory).

${POSE_ROOT}/
|-- dataset
|   |-- data_3d_h36m.npz
|   |-- data_2d_h36m_cpn_ft_h36m_dbb.npz

Download pretrained model

The pretrained model can be found in Google_Drive, please download it and put in the './checkpoint' dictory.

Test the model

To test on pretrained model on Human3.6M:

python main.py --reload --previous_dir 'checkpoint/pretrained'

Here, we compare our MHFormer with recent state-of-the-art methods on Human3.6M dataset. Evaluation metric is Mean Per Joint Position Error (MPJPE) in mm.

Models	MPJPE
VideoPose3D	46.8
PoseFormer	44.3
MHFormer	43.0

Train the model

To train on Human3.6M:

python main.py --train

Citation

If you find our work useful in your research, please consider citing:

@article{li2021mhformer,
  title={MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation},
  author={Li, Wenhao and Liu, Hong and Tang, Hao and Wang, Pichao and Van Gool, Luc},
  journal={arXiv preprint},
  year={2021}
}

Acknowledgement

Our code is extended from the following repositories. We thank the authors for releasing the codes.

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

Related tags

Overview

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

Dependencies

Dataset setup

Download pretrained model

Test the model

Train the model

Citation

Acknowledgement

Owner

Vegetabird

Pytorch implement of 'Unmixing based PAN guided fusion network for hyperspectral imagery'

Voice of Pajlada with model and weights.

We envision models that are pre-trained on a vast range of domain-relevant tasks to become key for molecule property prediction

Pytorch port of Google Research's LEAF Audio paper

Ensemble Visual-Inertial Odometry (EnVIO)

Colossal-AI: A Unified Deep Learning System for Large-Scale Parallel Training

An official source code for "Augmentation-Free Self-Supervised Learning on Graphs"

PyTorch version of the paper 'Enhanced Deep Residual Networks for Single Image Super-Resolution' (CVPRW 2017)

Codes to pre-train T5 (Text-to-Text Transfer Transformer) models pre-trained on Japanese web texts

Python project to take sound as input and output as RGB + Brightness values suitable for DMX

TensorFlow Implementation of Unsupervised Cross-Domain Image Generation

gACSON software for visualization, processing and analysis of three-dimensional electron microscopy images

Just playing with getting CLIP Guided Diffusion running locally, rather than having to use colab.

An end-to-end image translation model with weight-map for color constancy

This code is for our paper "VTGAN: Semi-supervised Retinal Image Synthesis and Disease Prediction using Vision Transformers"

Code for our paper A Transformer-Based Feature Segmentation and Region Alignment Method For UAV-View Geo-Localization,

PyTorch code for the NAACL 2021 paper "Improving Generation and Evaluation of Visual Stories via Semantic Consistency"

Deep Two-View Structure-from-Motion Revisited

Crab is a ﬂexible, fast recommender engine for Python that integrates classic information ﬁltering recommendation algorithms in the world of scientiﬁc Python packages (numpy, scipy, matplotlib).

Semantic Edge Detection with Diverse Deep Supervision