A learning-based data collection tool for human segmentation

Last update: Jun 24, 2022

Overview

FullBodyFilter

A Learning-Based Data Collection Tool For Human Segmentation

Overview

Human segmentation is a difficult machine learning task of identifying and extracting the human in a picture. Most of the time this is done by using a convolutional neural network. In order to achieve an accurate and robust model, large amounts of data with varying human poses need to be collected to train the model. Collecting and labeling train data by hand takes lots of time and resources. This project explores another option to use automtation to collect and label pre-existing data from internet videos.

The model that was focused on is the DTEN ME model used for Zoom meetings virtual background.

Openpose is used to filter the video for suitable frames, in particular single person full body frames. Mask R-CNN is the teacher model that generates training labels. To find which images perform poorly on ME model, a comparison is done between ME masks and Mask R-CNN masks. The result is a set of images and masks that can be used as training data.

Overview of Program

A full report of the system design and implemenation details can be found in doc

Sample Results

Examples of train data saved. In each image bottom left is Mask R-CNN mask and bottom right is ME mask.

Usage

This project relies on Openpose and Mask R-CNN and all their dependencies. Instructions on how to set up each are found in there respective directories here.

Documentation on how to use scripts are located in doc.

A learning-based data collection tool for human segmentation

Related tags

Overview

FullBodyFilter

Contents

Overview

Sample Results

Usage

Owner

Robert Jiang

This is a Image aid classification software based on python TK library development

A TikTok-like recommender system for GitHub repositories based on Gorse

A denoising diffusion probabilistic model (DDPM) tailored for conditional generation of protein distograms

Understanding the Effects of Datasets Characteristics on Offline Reinforcement Learning

Implementation of the paper titled "Using Sampling to Estimate and Improve Performance of Automated Scoring Systems with Guarantees"

The code repository for "RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection" (ACM MM'21)

Repository of best practices for deep learning in Julia, inspired by fastai

Repository relating to the CVPR21 paper TimeLens: Event-based Video Frame Interpolation

Picasso: A CUDA-based Library for Deep Learning over 3D Meshes

Create UIs for prototyping your machine learning model in 3 minutes

Deep learning-based approach to discovering Granger causality networks in multivariate time series

Clustergram - Visualization and diagnostics for cluster analysis in Python

🛠️ SLAMcore SLAM Utilities

Some pre-commit hooks for OpenMMLab projects

Flask101 - FullStack Web Development with Python & JS - From TAQWA

Stochastic Scene-Aware Motion Prediction

Build an Amazon SageMaker Pipeline to Transform Raw Texts to A Knowledge Graph

Spatial Contrastive Learning for Few-Shot Classification (SCL)

A python library for implementing a recommender system

Pytorch Implementation of Various Point Transformers