MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

Last update: Jan 07, 2023

Related tags

Overview

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

This repo is the official implementation of "MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation, Wenhao Li, Hong Liu, Hao Tang, Pichao Wang, Luc Van Gool" in PyTorch.

Dependencies

Cuda 11.1
Python 3.6
Pytorch 1.7.1

Dataset setup

Please download the dataset from Human3.6m website and refer to VideoPose3D to set up the Human3.6M dataset ('./dataset' directory).

${POSE_ROOT}/
|-- dataset
|   |-- data_3d_h36m.npz
|   |-- data_2d_h36m_cpn_ft_h36m_dbb.npz

Download pretrained model

The pretrained model can be found in Google_Drive, please download it and put in the './checkpoint' dictory.

Test the model

To test on pretrained model on Human3.6M:

python main.py --reload --previous_dir 'checkpoint/pretrained'

Here, we compare our MHFormer with recent state-of-the-art methods on Human3.6M dataset. Evaluation metric is Mean Per Joint Position Error (MPJPE) in mm.

Models	MPJPE
VideoPose3D	46.8
PoseFormer	44.3
MHFormer	43.0

Train the model

To train on Human3.6M:

python main.py --train

Citation

If you find our work useful in your research, please consider citing:

@article{li2021mhformer,
  title={MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation},
  author={Li, Wenhao and Liu, Hong and Tang, Hao and Wang, Pichao and Van Gool, Luc},
  journal={arXiv preprint},
  year={2021}
}

Acknowledgement

Our code is extended from the following repositories. We thank the authors for releasing the codes.

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

Related tags

Overview

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

Dependencies

Dataset setup

Download pretrained model

Test the model

Train the model

Citation

Acknowledgement

Owner

Vegetabird

Language-Agnostic Website Embedding and Classification

"Structure-Augmented Text Representation Learning for Efficient Knowledge Graph Completion"(WWW 2021)

StableSims is an open-source project aimed at simulating MakerDAO's Dai stablecoin system

Predictive AI layer for existing databases.

Running AlphaFold2 (from ColabFold) in Azure Machine Learning

Wordle Env: A Daily Word Environment for Reinforcement Learning

A torch.Tensor-like DataFrame library supporting multiple execution runtimes and Arrow as a common memory format

MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification

This is the official implementation for the paper "(Almost) Free Incentivized Exploration from Decentralized Learning Agents" in NeurIPS 2021.

Using Streamlit to host a multi-page tool with model specs and classification metrics, while also accepting user input values for prediction.

From Perceptron model to Deep Neural Network from scratch in Python.

Federated learning on graph, especially on graph neural networks (GNNs), knowledge graph, and private GNN.

Bridging Composite and Real: Towards End-to-end Deep Image Matting

Implementation for our ICCV2021 paper: Internal Video Inpainting by Implicit Long-range Propagation

DNA sequence classification by Deep Neural Network

A 10000+ hours dataset for Chinese speech recognition

The FIRST GANs-based omics-to-omics translation framework

This code is a toolbox that uses Torch library for training and evaluating the ERFNet architecture for semantic segmentation.

Code for weakly supervised segmentation of a single class

Image-to-image regression with uncertainty quantification in PyTorch