Keras implementation of PersonLab for Multi-Person Pose Estimation and Instance Segmentation.

Last update: Dec 21, 2022

Overview

PersonLab

This is a Keras implementation of PersonLab for Multi-Person Pose Estimation and Instance Segmentation. The model predicts heatmaps and various offsets which allow for computation of joint locations and connections as well as pixel instance ids. See the paper for more details.

Training a model

If you want to use Resnet101 as the base, first download the imagenet initialization weights from here and copy it to your ~/.keras/models/ directory. (Over 100MB files cannot be hosted on github.)

First, construct the dataset in the correct format by running the generate_hdf5.py script. Before running, just set the ANNO_FILE and IMG_DIR constants at the top of the script to the paths to the COCO person_keypoints annotation file and the image folder respectively.

Edit the config.py to set options for training, e.g. input resolution, number of GPUs, whether to freeze the batchnorm weights, etc. More advanced options require altering the train.py script. For example, changing the base network can be done by adding an argument to the get_personlab() function, see the documentation there.

After eveything is configured to your liking, go ahead and run the train.py script.

Testing a model

See the demo.ipynb for sample inference and visualizations.

Technical Debts

Several parts of this codebase are borrowed from others. These include:

The Resnet-101 in Keras
The augmentation code (which is different from the procedure in the PersonLab paper) and data iterator code is heavily borrowed from this fork of the Keras implementation of CMU's "Realtime Multi-Person Pose Estimation". (The pose plotting function is also influenced by the one in that repo.)
The Polyak Averaging callback is just a lightly modified version of the EMA callback from here

Environment

This code was tested in the following environment and with the following software versions:

Ubuntu 16.04
CUDA 8.0 with cudNN 6.0
Python 2.7
Tensorflow 1.7
Keras 2.1.3
OpenCV 2.4.9

Keras implementation of PersonLab for Multi-Person Pose Estimation and Instance Segmentation.

Related tags

Overview

PersonLab

Training a model

Testing a model

Technical Debts

Environment

Owner

OCTI

Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit

Python tools for 3D face: 3DMM, Mesh processing(transform, camera, light, render), 3D face representations.

Dialect classification

Constraint-based geometry sketcher for blender

Human pose estimation from video plays a critical role in various applications such as quantifying physical exercises, sign language recognition, and full-body gesture control.

This is a Pytorch implementation of paper: DropEdge: Towards Deep Graph Convolutional Networks on Node Classification

Least Square Calibration for Peer Reviews

Landmarks Recogntion Web application using Streamlit.

Torch implementation of SegNet and deconvolutional network

RaceBERT -- A transformer based model to predict race and ethnicty from names

Evaluating saliency methods on artificial data with different background types

ExCon: Explanation-driven Supervised Contrastive Learning

This repo provides code for QB-Norm (Cross Modal Retrieval with Querybank Normalisation)

Pytorch implementation of the unsupervised object discovery method LOST.

This is a vision-based 3d model manipulation and control UI

learning and feeling SLAM together with hands-on-experiments

BLEND: A Fast, Memory-Efficient, and Accurate Mechanism to Find Fuzzy Seed Matches

ECCV2020 paper: Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code and Data.

[ICCV2021] IICNet: A Generic Framework for Reversible Image Conversion

Cross-platform-profile-pic-changer - Script to change profile pictures across multiple platforms