Python scripts for performing 3D human pose estimation using the Mobile Human Pose model in ONNX.

Last update: Dec 31, 2022

Overview

ONNX-Mobile-Human-Pose-3D

Python scripts for performing 3D human pose estimation using the Mobile Human Pose model.

Original image for inference: (https://static2.diariovasco.com/www/pre2017/multimedia/noticias/201412/01/media/DF0N5391.jpg)

❗ ⚠️ Known issues

The models works well when the person is looking forward and without occlusions, it will start to fail as soon as the person is occluded.
The model is fast, but the 3D representation is slow due to matplotlib, this will be fixed. The 3d representation can be ommitted for faster inference by setting draw_3dpose to False

Requirements

OpenCV, imread-from-url, scipy, onnx and onnxruntime. Also, pafy and youtube-dl are required for youtube video inference.

Installation

pip install -r requirements.txt
pip install pafy youtube-dl

ONNX model

The original models were converted to different formats (including .onnx) by PINTO0309, download the models from his repository and save them into the models folder.

YOLOv5s: You will also need an object detector to first detect the people in the image. Download the model from the model zoo and save the .onnx version into the models folder.

Original model

The original model was taken from the original repository.

Examples

Image inference:

python imagePoseEstimation.py

Video inference:

python videoPoseEstimation.py

Webcam inference:

python webcamPoseEstimation.py

Inference video Example

References:

Mobile human pose model: https://github.com/SangbumChoi/MobileHumanPose
PINTO0309's model zoo: https://github.com/PINTO0309/PINTO_model_zoo
PINTO0309's model conversion tool: https://github.com/PINTO0309/openvino2tensorflow
3DMPPE_POSENET_RELEASE repository: https://github.com/mks0601/3DMPPE_POSENET_RELEASE
Original YOLOv5 repository: https://github.com/ultralytics/yolov5
Original paper: https://openaccess.thecvf.com/content/CVPR2021W/MAI/html/Choi_MobileHumanPose_Toward_Real-Time_3D_Human_Pose_Estimation_in_Mobile_Devices_CVPRW_2021_paper.html

Python scripts for performing 3D human pose estimation using the Mobile Human Pose model in ONNX.

Related tags

Overview

ONNX-Mobile-Human-Pose-3D

❗ ⚠️ Known issues

Requirements

Installation

ONNX model

Original model

Examples

Inference video Example

References:

Owner

Ibai Gorordo

TimeSHAP explains Recurrent Neural Network predictions.

Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning (ICLR 2021)

Unofficial Tensorflow-Keras implementation of Fastformer based on paper [Fastformer: Additive Attention Can Be All You Need](https://arxiv.org/abs/2108.09084).

modelvshuman is a Python library to benchmark the gap between human and machine vision

“袋鼯麻麻——智能购物平台”能够精准地定位识别每一个商品

TensorFlow Metal Backend on Apple Silicon Experiments (just for fun)

Official implementation of the paper Do pedestrians pay attention? Eye contact detection for autonomous driving

This repository contains the DendroMap implementation for scalable and interactive exploration of image datasets in machine learning.

EMNLP'2021: Simple Entity-centric Questions Challenge Dense Retrievers

Repository of best practices for deep learning in Julia, inspired by fastai

Official repository of the paper Privacy-friendly Synthetic Data for the Development of Face Morphing Attack Detectors

Pre-trained Deep Learning models and demos (high quality and extremely fast)

Official Pytorch implementation for Deep Contextual Video Compression, NeurIPS 2021

Denoising images with Fourier Ring Correlation loss

Learning to trade under the reinforcement learning framework

Semantic segmentation task for ADE20k & cityscapse dataset, based on several models.

A tiny, friendly, strong baseline code for Person-reID (based on pytorch).

Creating a Linear Program Solver by Implementing the Simplex Method in Python with NumPy

LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference

Speech Enhancement Generative Adversarial Network Based on Asymmetric AutoEncoder