Realtime_Multi-Person_Pose_Estimation

Last update: Jan 05, 2023

Overview

Introduction

Multi Person PoseEstimation By PyTorch

Results

Require

Pytorch

Installation

git submodule init && git submodule update

Demo

Download converted pytorch model.
Compile the C++ postprocessing: cd lib/pafprocess; sh make.sh
python demo/picture_demo.py to run the picture demo.
python demo/web_demo.py to run the web demo.

Evalute

python evaluate/evaluation.py to evaluate the model on coco val2017 dataset.
It should have mAP 0.653 for the rtpose, previous rtpose have mAP 0.577 because we do left and right flip for heatmap and PAF for the evaluation. c

Main Results

model name	mAP	Inference Time
[original rtpose]	0.653	-

Download link: rtpose

Development environment

The code is developed using python 3.6 on Ubuntu 18.04. NVIDIA GPUs are needed. The code is developed and tested using 4 1080ti GPU cards. Other platforms or GPU cards are not fully tested.

Quick start

1. Preparation

1.1 Prepare the dataset

cd training; bash getData.sh to obtain the COCO 2017 images in /data/root/coco/images/, keypoints annotations in /data/root/coco/annotations/, make them look like this:

${DATA_ROOT}
|-- coco
    |-- annotations
        |-- person_keypoints_train2017.json
        |-- person_keypoints_val2017.json
    |-- images
        |-- train2017
            |-- 000000000009.jpg
            |-- 000000000025.jpg
            |-- 000000000030.jpg
            |-- ... 
        |-- val2017
            |-- 000000000139.jpg
            |-- 000000000285.jpg
            |-- 000000000632.jpg
            |-- ...

2. How to train the model

Modify the data directory in train/train_VGG19.py and python train/train_VGG19.py

Related repository

CVPR'17, Realtime Multi-Person Pose Estimation.

Network Architecture

testing architecture
training architecture

Contributions

All contributions are welcomed. If you encounter any issue (including examples of images where it fails) feel free to open an issue.

Citation

Please cite the paper in your publications if it helps your research:

@InProceedings{cao2017realtime,
  title = {Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields},
  author = {Zhe Cao and Tomas Simon and Shih-En Wei and Yaser Sheikh},
  booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year = {2017}
  }

Realtime_Multi-Person_Pose_Estimation

Related tags

Overview

Introduction

Results

Require

Installation

Demo

Evalute

Main Results

Development environment

Quick start

1. Preparation

1.1 Prepare the dataset

2. How to train the model

Related repository

Network Architecture

Contributions

Citation

Owner

tensorboy

TPH-YOLOv5: Improved YOLOv5 Based on Transformer Prediction Head for Object Detection on Drone-Captured Scenarios

Reverse engineer your pytorch vision models, in style

[CVPR 2021] NormalFusion: Real-Time Acquisition of Surface Normals for High-Resolution RGB-D Scanning

Knowledge Management for Humans using Machine Learning & Tags

GitHub repository for the ICLR Computational Geometry & Topology Challenge 2021

City-seeds - A random generator of cultural characteristics intended to spark ideas and help draw threads

STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech

Lightweight Face Image Quality Assessment

A Streamlit demo demonstrating the Deep Dream technique. Adapted from the TensorFlow Deep Dream tutorial.

The code is an implementation of Feedback Convolutional Neural Network for Visual Localization and Segmentation.

Imaginaire - NVIDIA's Deep Imagination Team's PyTorch Library

Haze Removal can remove slight to extreme cases of haze affecting an image

Live training loss plot in Jupyter Notebook for Keras, PyTorch and others

Implementation of the method described in the Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.

Pytorch implementation of Supporting Clustering with Contrastive Learning, NAACL 2021

Translation-equivariant Image Quantizer for Bi-directional Image-Text Generation

Training DiffWave using variational method from Variational Diffusion Models.

[CVPR 2020] Interpreting the Latent Space of GANs for Semantic Face Editing

Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite

EXplainable Artificial Intelligence (XAI)