Monocular 3D pose estimation. OpenVINO. CPU inference or iGPU (OpenCL) inference.

Last update: Oct 03, 2022

Overview

human-pose-estimation-3d-python-cpp

RealSenseD435 (RGB) 480x640 + CPU Corei9 45 FPS (Depth is not used)

1. Run

1-1. RealSenseD435 (RGB) 480x640 + CPU Corei9 45 FPS (Depth is not used)

$ xhost +local: && \
docker run -it --rm \
-v `pwd`:/home/user/workdir \
-v /tmp/.X11-unix/:/tmp/.X11-unix:rw \
--device /dev/video0:/dev/video0:mwr \
--device /dev/video1:/dev/video1:mwr \
--device /dev/video2:/dev/video2:mwr \
--device /dev/video3:/dev/video3:mwr \
--device /dev/video4:/dev/video4:mwr \
--device /dev/video5:/dev/video5:mwr \
--net=host \
-e XDG_RUNTIME_DIR=$XDG_RUNTIME_DIR \
-e DISPLAY=$DISPLAY \
--privileged \
ghcr.io/pinto0309/openvino2tensorflow:latest

$ python3 human_pose_estimation_3d_demo.py \
--model models/openvino/FP16/human-pose-estimation-3d-0001_bgr_480x640.xml \
--device CPU \
--input 4

1-2. RealSenseD435 (RGB) 480x640 + iGPU (OpenCL)

$ xhost +local: && \
docker run -it --rm \
-v `pwd`:/home/user/workdir \
-v /tmp/.X11-unix/:/tmp/.X11-unix:rw \
--device /dev/video0:/dev/video0:mwr \
--device /dev/video1:/dev/video1:mwr \
--device /dev/video2:/dev/video2:mwr \
--device /dev/video3:/dev/video3:mwr \
--device /dev/video4:/dev/video4:mwr \
--device /dev/video5:/dev/video5:mwr \
--net=host \
-e LIBVA_DRIVER_NAME=iHD \
-e XDG_RUNTIME_DIR=$XDG_RUNTIME_DIR \
-e DISPLAY=$DISPLAY \
--privileged \
ghcr.io/pinto0309/openvino2tensorflow:latest

$ python3 human_pose_estimation_3d_demo.py \
--model models/openvino/FP16/human-pose-estimation-3d-0001_bgr_480x640.xml \
--device GPU \
--input 4

1-3. General USB Camera 480x640 + CPU

$ xhost +local: && \
docker run -it --rm \
-v `pwd`:/home/user/workdir \
-v /tmp/.X11-unix/:/tmp/.X11-unix:rw \
--device /dev/video0:/dev/video0:mwr \
--net=host \
-e XDG_RUNTIME_DIR=$XDG_RUNTIME_DIR \
-e DISPLAY=$DISPLAY \
--privileged \
ghcr.io/pinto0309/openvino2tensorflow:latest

$ python3 human_pose_estimation_3d_demo.py \
--model models/openvino/FP16/human-pose-estimation-3d-0001_bgr_480x640.xml \
--device CPU \
--input 0

2. Build

$ PYTHON_PREFIX=$(python3 -c "import sys; print(sys.prefix)") \
&& PYTHON_VERSION=$(python3 -c "import sys; print(f'{sys.version_info.major}.{sys.version_info.minor}')") \
&& PYTHON_INCLUDE_DIRS=${PYTHON_PREFIX}/include/python${PYTHON_VERSION}

$ NUMPY_INCLUDE_DIR=$(python3 -c "import numpy; print(numpy.get_include())")

$ mkdir -p pose_extractor/build && cd pose_extractor/build

$ cmake \
-DPYTHON_INCLUDE_DIRS=${PYTHON_INCLUDE_DIRS} \
-DNUMPY_INCLUDE_DIR=${NUMPY_INCLUDE_DIR} ..

$ make && cp pose_extractor.so ../.. && cd ../..

Monocular 3D pose estimation. OpenVINO. CPU inference or iGPU (OpenCL) inference.

Related tags

Overview

human-pose-estimation-3d-python-cpp

1. Run

1-1. RealSenseD435 (RGB) 480x640 + CPU Corei9 45 FPS (Depth is not used)

1-2. RealSenseD435 (RGB) 480x640 + iGPU (OpenCL)

1-3. General USB Camera 480x640 + CPU

2. Build

3. Reference

Owner

Katsuya Hyodo

Stochastic gradient descent with model building

This repository contains the exercises and its solution contained in the book "An Introduction to Statistical Learning" in python.

CUP-DNN is a deep neural network model used to predict tissues of origin for cancers of unknown of primary.

CVPRW 2021: How to calibrate your event camera

Official implementation of the MM'21 paper Constrained Graphic Layout Generation via Latent Optimization

A hybrid SOTA solution of LiDAR panoptic segmentation with C++ implementations of point cloud clustering algorithms. ICCV21, Workshop on Traditional Computer Vision in the Age of Deep Learning

[CVPR2021 Oral] UP-DETR: Unsupervised Pre-training for Object Detection with Transformers

DiAne is a smart fuzzer for IoT devices

A variational Bayesian method for similarity learning in non-rigid image registration (CVPR 2022)

An imperfect information game is a type of game with asymmetric information

Semantic segmentation task for ADE20k & cityscapse dataset, based on several models.

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

Learned model to estimate number of distinct values (NDV) of a population using a small sample.

Joint Channel and Weight Pruning for Model Acceleration on Mobile Devices

Repo for 2021 SDD assessment task 2, by Felix, Anna, and James.

Contains code for the paper "Vision Transformers are Robust Learners".

FIRM-AFL is the first high-throughput greybox fuzzer for IoT firmware.

Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for Generalized Face Presentation Attack Detection

Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations, CVPR 2019 (Oral)