FrankMocap: A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

Last update: Jan 07, 2023

Related tags

Overview

FrankMocap: A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

FrankMocap pursues an easy-to-use single view 3D motion capture system developed by Facebook AI Research (FAIR). FrankMocap provides state-of-the-art 3D pose estimation outputs for body, hand, and body+hands in a single system. The core objective of FrankMocap is to democratize the 3D human pose estimation technology, enabling anyone (researchers, engineers, developers, artists, and others) can easily obtain 3D motion capture outputs from videos and images.

Btw, why the name FrankMocap? Our pipeline to integrate body and hand modules reminds us of Frankenstein's monster!

News:

[2020/10/09] We have improved openGL rendering speed. It's about 40% faster. (e.g., body module: 6fps -> 11fps)

Key Features

Body Motion Capture:

Hand Motion Capture

Egocentric Hand Motion Capture

Whole body Motion Capture (body + hands)

Installation

See INSTALL.md

A Quick Start

Run body motion capture

# using a machine with a monitor to show output on screen
python -m demo.demo_bodymocap --input_path ./sample_data/han_short.mp4 --out_dir ./mocap_output

# screenless mode (e.g., a remote server)
xvfb-run -a python -m demo.demo_bodymocap --input_path ./sample_data/han_short.mp4 --out_dir ./mocap_output

Run hand motion capture

# using a machine with a monitor to show outputs on screen
python -m demo.demo_handmocap --input_path ./sample_data/han_hand_short.mp4 --out_dir ./mocap_output

# screenless mode  (e.g., a remote server)
xvfb-run -a python -m demo.demo_handmocap --input_path ./sample_data/han_hand_short.mp4 --out_dir ./mocap_output

Run whole body motion capture

# using a machine with a monitor to show outputs on screen
python -m demo.demo_frankmocap --input_path ./sample_data/han_short.mp4 --out_dir ./mocap_output

# screenless mode  (e.g., a remote server)
xvfb-run -a python -m demo.demo_frankmocap --input_path ./sample_data/han_short.mp4 --out_dir ./mocap_output

Note:
- Above commands use openGL by default. If it does not work, you may try alternative renderers (pytorch3d or openDR).
- See the readme of each module for details

Joint Order

See joint_order

Body Motion Capture Module

See run_bodymocap

Hand Motion Capture Module

See run_handmocap

Whole Body Motion Capture Module (Body + Hand)

See run_totalmocap

License

CC-BY-NC 4.0. See the LICENSE file.

References

FrankMocap is based on the following research outputs:

@article{rong2020frankmocap,
  title={FrankMocap: Fast Monocular 3D Hand and Body Motion Capture by Regression and Integration},
  author={Rong, Yu and Shiratori, Takaaki and Joo, Hanbyul},
  journal={arXiv preprint arXiv:2008.08324},
  year={2020}
}

@article{joo2020eft,
  title={Exemplar Fine-Tuning for 3D Human Pose Fitting Towards In-the-Wild 3D Human Pose Estimation},
  author={Joo, Hanbyul and Neverova, Natalia and Vedaldi, Andrea},
  journal={arXiv preprint arXiv:2004.03686},
  year={2020}
}

FrankMocap leverages many amazing open-sources shared in research community.
- SMPL, SMPLX
- Detectron2
- Pytorch3D (for rendering)
- OpenDR (for rendering)
- SPIN (for body module)
- 100DOH (for hand detection)
- lightweight-human-pose-estimation (for body detection)

FrankMocap: A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

Related tags

Overview

FrankMocap: A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

News:

Key Features

Installation

A Quick Start

Joint Order

Body Motion Capture Module

Hand Motion Capture Module

Whole Body Motion Capture Module (Body + Hand)

License

References

Owner

Facebook Research

Improving Machine Translation Systems via Isotopic Replacement

Replication of Pix2Seq with Pretrained Model

[ACM MM 2021] Multiview Detection with Shadow Transformer (and View-Coherent Data Augmentation)

A series of Jupyter notebooks with Chinese comment that walk you through the fundamentals of Machine Learning and Deep Learning in python using Scikit-Learn and TensorFlow.

Official Repository of NeurIPS2021 paper: PTR

Equivariant CNNs for the sphere and SO(3) implemented in PyTorch

Some code of the implements of Geological Modeling Using 3D Pixel-Adaptive and Deformable Convolutional Neural Network

PyTorch implementation for View-Guided Point Cloud Completion

Deep ViT Features as Dense Visual Descriptors

[NeurIPS 2020] Code for the paper "Balanced Meta-Softmax for Long-Tailed Visual Recognition"

object recognition with machine learning on Respberry pi

A python3 tool to take a 360 degree survey of the RF spectrum (hamlib + rotctld + RTL-SDR/HackRF)

Code for Transformers Solve Limited Receptive Field for Monocular Depth Prediction

PyTorch implementation of NeurIPS 2021 paper: "CoFiNet: Reliable Coarse-to-fine Correspondences for Robust Point Cloud Registration"

Towards End-to-end Video-based Eye Tracking

Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"

NeRD: Neural Reflectance Decomposition from Image Collections

Weakly supervised medical named entity classification

This repository contains the source code of our work on designing efficient CNNs for computer vision

CS550 Machine Learning course project on CNN Detection.