Ultra-lightweight human body posture key point CNN model. ModelSize:2.3MB HUAWEI P40 NCNN benchmark: 6ms/img,

Last update: Dec 27, 2022

Overview

Ultralight-SimplePose

Support NCNN mobile terminal deployment
Based on MXNET(>=1.5.1) GLUON(>=0.7.0) framework
Top-down strategy: The input image is the person ROI detected by the object detector
Lightweight mobile terminal human body posture key point model(COCO 17 person_keypoints)
Detector:https://github.com/dog-qiuqiu/MobileNetv2-YOLOV3

Model

Mobile inference frameworks benchmark (4*ARM_CPU)

Network	Resolution	Inference time (NCNN/Kirin 990)	FLOPS	Weight size	HeatmapAccuracy
Ultralight-Nano-SimplePose	W:192 H:256	~5.4ms	0.224BFlops	2.3MB	74.3%

COCO2017 val keypoints metrics evaluate

 Average Precision  (AP) @[ IoU=0.50:0.95 | area=   all | maxDets= 20 ] = 0.518
 Average Precision  (AP) @[ IoU=0.50      | area=   all | maxDets= 20 ] = 0.816
 Average Precision  (AP) @[ IoU=0.75      | area=   all | maxDets= 20 ] = 0.558
 Average Precision  (AP) @[ IoU=0.50:0.95 | area=medium | maxDets= 20 ] = 0.498
 Average Precision  (AP) @[ IoU=0.50:0.95 | area= large | maxDets= 20 ] = 0.549
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets= 20 ] = 0.563
 Average Recall     (AR) @[ IoU=0.50      | area=   all | maxDets= 20 ] = 0.837
 Average Recall     (AR) @[ IoU=0.75      | area=   all | maxDets= 20 ] = 0.607
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=medium | maxDets= 20 ] = 0.535
 Average Recall     (AR) @[ IoU=0.50:0.95 | area= large | maxDets= 20 ] = 0.604

Install

pip install mxnet-cu101 gluoncv
pip install opencv-python cython pycocotools

Install mxnet according to your own cuda version

Demo

Test picture

python img_demo.py

Test camera stream

python cam_demo

How To Train

Download the coco2017 dataset

http://images.cocodataset.org/zips/train2017.zip
http://images.cocodataset.org/annotations/annotations_trainval2017.zip
http://images.cocodataset.org/zips/val2017.zip
Unzip the downloaded dataset zip file to the coco directory
交流qq群:1062122604

Train

python train_simple_pose.py

Ncnn Deploy

Dependent library: Opencv Ncnn
Read the camera video stream test by default, if you test the picture, please modify the code

Install ncnn

$ git clone https://github.com/Tencent/ncnn.git
$ cd <ncnn-root-dir>
$ mkdir -p build
$ cd build
$ make -j4
$ make install

Run ncnn sample

$ cp -rf ncnn/build/install/include ./Ultralight-SimplePose/ncnnsample/
$ cp -rf ncnn/build/install/lib ./Ultralight-SimplePose/ncnnsample/
$ g++ -o ncnnpose ncnnpose.cpp -I include/ncnn/ lib/libncnn.a `pkg-config --libs --cflags opencv` -fopenmp
$ ./ncnnpose

Ultra-lightweight human body posture key point CNN model. ModelSize:2.3MB HUAWEI P40 NCNN benchmark: 6ms/img,

Related tags

Overview

Ultralight-SimplePose

Model

Mobile inference frameworks benchmark (4*ARM_CPU)

COCO2017 val keypoints metrics evaluate

Install

Demo

Test picture

Test camera stream

How To Train

Download the coco2017 dataset

Train

Ncnn Deploy

Install ncnn

Run ncnn sample

Ncnn Picture test results

Android sample

Thanks

Owner

Spatial Transformer Nets in TensorFlow/ TensorLayer

Image Segmentation using U-Net, U-Net with skip connections and M-Net architectures

You Only Sample (Almost) Once: Linear Cost Self-Attention Via Bernoulli Sampling

Escaping the Gradient Vanishing: Periodic Alternatives of Softmax in Attention Mechanism

A curated list of resources for Image and Video Deblurring

TransMorph: Transformer for Medical Image Registration

(CVPR 2022 - oral) Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry

[ ICCV 2021 Oral ] Our method can estimate camera poses and neural radiance fields jointly when the cameras are initialized at random poses in complex scenarios (outside-in scenes, even with less texture or intense noise )

Code for the paper "Adapting Monolingual Models: Data can be Scarce when Language Similarity is High"

Exploit ILP to learn symmetry breaking constraints of ASP programs.

Animate molecular orbital transitions using Psi4 and Blender

Code used for the results in the paper "ClassMix: Segmentation-Based Data Augmentation for Semi-Supervised Learning"

A Python library for common tasks on 3D point clouds

Weighted QMIX: Expanding Monotonic Value Function Factorisation

Segmentation Training Pipeline

This is project is the implementation of the DeepShift: Towards Multiplication-Less Neural Networks paper

Not Suitable for Work (NSFW) classification using deep neural network Caffe models.

PyTorch implementations of the NeRF model described in "NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis"

CROSS-LINGUAL ABILITY OF MULTILINGUAL BERT: AN EMPIRICAL STUDY

A pure PyTorch implementation of the loss described in "Online Segment to Segment Neural Transduction"