OpenCV, MediaPipe Pose Estimation, Affine Transform for Icon Overlay

Last update: Dec 03, 2021

Related tags

Overview

Yoga Pose Identification and Icon Matching

Project Goal

Detect yoga poses performed by a user and overlay a corresponding icon image. Running the main script starts the videostream with automatic pose detection.

Part 1: Pose Detection

I use the 32 body landmarks provided by MediaPipe to measure joint angles, then determine yoga poses based on key joint angles for each pose. For example, in the star pose, the angle between the shoulder, elbow, and wrist landmarks (elbow flexion) are below 20 degrees and the angle of the elbow, shoulder, and opposite shoulder (shoulder flexion) are also below 20 degrees.

Part 2: Icon Image Transformation

To transform the icon image that will be overlayed over the user, I first preprocess the icon image then apply an affine transform. To preprocess the icon, I resize the icon image to be roughly the same heigt as the user, a metric also calculated with MediaPie's landmarks. I then apply a border to the icon image so that its image array has the same dimensions as the video stream frames. These steps help make the affine transform more effective. I select three key pose landmarks for each pose, then find three key points on the icon that should match these points. For example, I chose to match the nose and ankles of the person with the top tip and bottom two tips of the star.

Part 3: Image Overlay

I overlayed just the icon pixels (the icon background is ignored) by summing .5 of the icon pixel value with .5 of the the video frame value, resulting in a transparent overlay of just the icon.

OpenCV, MediaPipe Pose Estimation, Affine Transform for Icon Overlay

Related tags

Overview

Yoga Pose Identification and Icon Matching

Project Goal

Part 1: Pose Detection

Part 2: Icon Image Transformation

Part 3: Image Overlay

Results

Star Pose

Tree Pose

Chair pose

Owner

Anna Garverick

CLIPImageClassifier wraps clip image model from transformers

A Python module for the generation and training of an entry-level feedforward neural network.

Official Repository of NeurIPS2021 paper: PTR

The repo for the paper "I3CL: Intra- and Inter-Instance Collaborative Learning for Arbitrary-shaped Scene Text Detection".

Pytorch implementation of MLP-Mixer with loading pre-trained models.

This repository contains tutorials for the py4DSTEM Python package

Real-ESRGAN aims at developing Practical Algorithms for General Image Restoration.

A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

Feature extraction made simple with torchextractor

Public Models considered for emotion estimation from EEG

Deep Learning for Time Series Forecasting.

Equivariant CNNs for the sphere and SO(3) implemented in PyTorch

PyJokes - Joking around with Python library pyjokes

code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning" by Zhongzheng Ren, Raymond A. Yeh, Alexander G. Schwing.

Full body anonymization - Realistic Full-Body Anonymization with Surface-Guided GANs

Check out the StyleGAN repo and place it in the same directory hierarchy as the present repo

A robust pointcloud registration pipeline based on correlation.

A PyTorch library and evaluation platform for end-to-end compression research

Official repository of "DeepMIH: Deep Invertible Network for Multiple Image Hiding", TPAMI 2022.

RaceBERT -- A transformer based model to predict race and ethnicty from names

OpenCV, MediaPipe Pose Estimation, Affine Transform for Icon Overlay

Related tags

Overview

Yoga Pose Identification and Icon Matching

Project Goal

Part 1: Pose Detection

Part 2: Icon Image Transformation

Part 3: Image Overlay

Results

Star Pose

Tree Pose

Chair pose

Owner

Anna Garverick

CLIPImageClassifier wraps clip image model from transformers

A Python module for the generation and training of an entry-level feedforward neural network.

Official Repository of NeurIPS2021 paper: PTR

The repo for the paper "I3CL: Intra- and Inter-Instance Collaborative Learning for Arbitrary-shaped Scene Text Detection".

Pytorch implementation of MLP-Mixer with loading pre-trained models.

This repository contains tutorials for the py4DSTEM Python package

Real-ESRGAN aims at developing Practical Algorithms for General Image Restoration.

A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

Feature extraction made simple with torchextractor

Public Models considered for emotion estimation from EEG

Deep Learning for Time Series Forecasting.

Equivariant CNNs for the sphere and SO(3) implemented in PyTorch

PyJokes - Joking around with Python library pyjokes

code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning" by Zhongzheng Ren*, Raymond A. Yeh*, Alexander G. Schwing.

Full body anonymization - Realistic Full-Body Anonymization with Surface-Guided GANs

Check out the StyleGAN repo and place it in the same directory hierarchy as the present repo

A robust pointcloud registration pipeline based on correlation.

A PyTorch library and evaluation platform for end-to-end compression research

Official repository of "DeepMIH: Deep Invertible Network for Multiple Image Hiding", TPAMI 2022.

RaceBERT -- A transformer based model to predict race and ethnicty from names

code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning" by Zhongzheng Ren, Raymond A. Yeh, Alexander G. Schwing.