Code for "3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop"

Last update: Dec 28, 2022

Overview

PyMAF

This repository contains the code for the following paper:

3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop
Hongwen Zhang*, Yating Tian*, Xinchi Zhou, Wanli Ouyang, Yebin Liu, Limin Wang, Zhenan Sun

* Equal contribution

[Project Page] [ArXiv] [Paper]

Requirements

Python 3.6.10

packages

PyTorch tested on version 1.1.0
torchvision tested on version 0.3.0
Neural Renderer (render densepose labels for training)
opendr (visualization in training)
pyrender (optional for demo)
other packages listed in requirements.txt

necessary files

mesh_downsampling.npz & DensePose UV data

Run the following script to fetch mesh_downsampling.npz & DensePose UV data from other repositories.

bash fetch_data.sh

SMPL model files

Collect SMPL model files from https://smpl.is.tue.mpg.de and UP. Rename model files and put them into the ./data/smpl directory.

Fetch preprocessed data from SPIN.

Download the pre-trained model and put it into the ./data/pretrained_model directory.

After collecting the above necessary files, the directory structure of ./data is expected as follows.

./data
├── dataset_extras
│   └── .npz files
├── J_regressor_extra.npy
├── J_regressor_h36m.npy
├── mesh_downsampling.npz
├── pretrained_model
│   └── PyMAF_model_checkpoint.pt
├── smpl
│   ├── SMPL_FEMALE.pkl
│   ├── SMPL_MALE.pkl
│   └── SMPL_NEUTRAL.pkl
├── smpl_mean_params.npz
├── static_fits
│   └── .npy files
└── UV_data
    ├── UV_Processed.mat
    └── UV_symmetry_transforms.mat

Demo

[UPDATE] You can first give it a try on Google Colab using the notebook we have prepared, which is no need to prepare the environment yourself:

Run the demo code.

python3 demo.py --checkpoint=data/pretrained_model/PyMAF_model_checkpoint.pt --vid_file ./flashmob.mp4

^{Frame by frame reconstruction. Video clipped from here.}

Evaluation

Human3.6M / 3DPW

Run the evaluation code. Using --dataset to specify the evaluation dataset.

# Example usage:

# Human3.6M Protocol 2
python3 eval.py --checkpoint=data/pretrained_model/PyMAF_model_checkpoint.pt --dataset=h36m-p2 --log_freq=20

# 3DPW
python3 eval.py --checkpoint=data/pretrained_model/PyMAF_model_checkpoint.pt --dataset=3dpw --log_freq=20

COCO Keypoint Localization

Download the preprocessed data coco_2014_val.npz. Put it into the ./data/dataset_extras directory.
Run the COCO evaluation code.

python3 eval_coco.py --checkpoint=data/pretrained_model/PyMAF_model_checkpoint.pt

Training

To perform training, we need to collect preprocessed files of training datasets at first.

The preprocessed labels have the same format as SPIN and can be retrieved from here. Please refer to SPIN for more details about data preprocessing.

PyMAF is trained on Human3.6M at the first stage and then trained on the mixture of both 2D and 3D datasets at the second stage. Example usage:

# training on Human3.6M
python3 train.py --regressor pymaf_net --single_dataset --misc TRAIN.BATCH_SIZE 64
# training on mixed datasets
python3 train.py --regressor pymaf_net --pretrained_checkpoint path/to/checkpoint_file.pt --misc TRAIN.BATCH_SIZE 64

Running the above commands will use Human3.6M or mixed datasets for training, respectively. We can monitor the training process by setting up a TensorBoard at the directory ./logs.

Citation

If this work is helpful in your research, please cite the following paper.

@article{pymaf2021,
  title={3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop},
  author={Zhang, Hongwen and Tian, Yating and Zhou, Xinchi and Ouyang, Wanli and Liu, Yebin and Wang, Limin and Sun, Zhenan},
  journal={arXiv preprint arXiv:2103.16507},
  year={2021}
}

Acknowledgments

The code is developed upon the following projects. Many thanks to their contributions.

Code for "3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop"

Related tags

Overview

PyMAF

Requirements

packages

necessary files

Demo

Evaluation

Human3.6M / 3DPW

COCO Keypoint Localization

Training

Citation

Acknowledgments

Owner

Hongwen Zhang

Implementation of MA-Trace - a general-purpose multi-agent RL algorithm for cooperative environments.

This is a Tensorflow implementation of Learning to See in the Dark in CVPR 2018

Gradient-free global optimization algorithm for multidimensional functions based on the low rank tensor train format

Metric learning algorithms in Python

Code for the paper Relation Prediction as an Auxiliary Training Objective for Improving Multi-Relational Graph Representations (AKBC 2021).

Deep learning models for classification of 15 common weeds in the southern U.S. cotton production systems.

THIS IS THE OLD PYMC PROJECT. PLEASE USE PYMC3 INSTEAD:

Virtual hand gesture mouse using a webcam

A Tensorflow implementation of BicycleGAN.

Notes taking website build with Docker + Django + React.

Game Agent Framework. Helping you create AIs / Bots that learn to play any game you own!

Adversarial Framework for (non-) Parametric Image Stylisation Mosaics

Camview - A CLI-tool used to stream CCTV online footage based on URL params

Using machine learning to predict and analyze high and low reader engagement for New York Times articles posted to Facebook.

Implementation for "Conditional entropy minimization principle for learning domain invariant representation features"

Videocaptioning.pytorch - A simple implementation of video captioning

Fast, Attemptable Route Planner for Navigation in Known and Unknown Environments

Automatically Build Multiple ML Models with a Single Line of Code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.

Tool for working with Y-chromosome data from YFull and FTDNA

Annotate datasets with a semi-trained or fully trained YOLOv5 model

Code for "3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop"

Related tags

Overview

PyMAF

Requirements

packages

necessary files

Demo

Evaluation

Human3.6M / 3DPW

COCO Keypoint Localization

Training

Citation

Acknowledgments

Owner

Hongwen Zhang

Implementation of MA-Trace - a general-purpose multi-agent RL algorithm for cooperative environments.

This is a Tensorflow implementation of Learning to See in the Dark in CVPR 2018

Gradient-free global optimization algorithm for multidimensional functions based on the low rank tensor train format

Metric learning algorithms in Python

Code for the paper Relation Prediction as an Auxiliary Training Objective for Improving Multi-Relational Graph Representations (AKBC 2021).

Deep learning models for classification of 15 common weeds in the southern U.S. cotton production systems.

THIS IS THE **OLD** PYMC PROJECT. PLEASE USE PYMC3 INSTEAD:

Virtual hand gesture mouse using a webcam

A Tensorflow implementation of BicycleGAN.

Notes taking website build with Docker + Django + React.

Game Agent Framework. Helping you create AIs / Bots that learn to play any game you own!

Adversarial Framework for (non-) Parametric Image Stylisation Mosaics

Camview - A CLI-tool used to stream CCTV online footage based on URL params

Using machine learning to predict and analyze high and low reader engagement for New York Times articles posted to Facebook.

Implementation for "Conditional entropy minimization principle for learning domain invariant representation features"

Videocaptioning.pytorch - A simple implementation of video captioning

Fast, Attemptable Route Planner for Navigation in Known and Unknown Environments

Automatically Build Multiple ML Models with a Single Line of Code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.

Tool for working with Y-chromosome data from YFull and FTDNA

Annotate datasets with a semi-trained or fully trained YOLOv5 model

THIS IS THE OLD PYMC PROJECT. PLEASE USE PYMC3 INSTEAD: