[CVPR'22] COAP: Learning Compositional Occupancy of People

Last update: Dec 11, 2022

Related tags

Deep Learning COAP

Overview

COAP: Compositional Articulated Occupancy of People

Paper | Video | Project Page

This is the official implementation of the CVPR 2022 paper COAP: Learning Compositional Occupancy of People.

Description

This repository provides the official implementation of an implicit human body model (COAP) which implements efficient loss terms for resolving self-intersection and collisions with 3D geometries.

Installation

The necessary requirements are specified in the requrements.txt file. To install COAP, execute:

pip install git+https://github.com/markomih/COAP.git

Note that Pytorch3D may require manuall installation (see instructions here). Alternatively, we provide a conda environment file to install the dependences:

conda env create -f environment.yml
conda activate coap
pip install git+https://github.com/markomih/COAP.git

Optional Dependencies

Install the pyrender package to use the visualization/tutorial scripts and follow the additional instructions specified here if you wish to retrain COAP.

Tutorials

COAP extends the interface of the SMPL-X package (follow its instructions for the usage) via two volumetric loss terms: 1) a loss for resolving self-intersections and 2) a loss for resolving collisions with 3D geometries flexibly represented as point clouds. In the following, we provide a minimal interface to access the COAP's functionalities:

import smplx
from coap import attach_coap

# create a SMPL body and extend the SMPL body via COAP (we support: smpl, smplh, and smplx model types)
model = smplx.create(**smpl_parameters)
attach_coap(model)

smpl_output = model(**smpl_data)  # smpl forward pass
# NOTE: make sure that smpl_output contains the valid SMPL variables (pose parameters, joints, and vertices). 
assert model.joint_mapper is None, 'COAP requires valid SMPL joints as input'

# access two loss functions
model.coap.selfpen_loss(smpl_output)  # self-intersections
model.coap.collision_loss(smpl_output, scan_point_cloud)  # collisions with other geometris

Additionally, we provide two tutorials on how to use these terms to resolve self-intersections and collisions with the environment.

Pretrained Models

A respective pretrained model will be automatically fetched and loaded. All the pretrained models are available on the dev branch inside the ./models directory.

Citation

@inproceedings{Mihajlovic:CVPR:2022,
   title = {{COAP}: Compositional Articulated Occupancy of People},
   author = {Mihajlovic, Marko and Saito, Shunsuke and Bansal, Aayush and Zollhoefer, Michael and Tang, Siyu},
   booktitle = {Proceedings IEEE Conf. on Computer Vision and Pattern Recognition (CVPR)},
   month = jun,
   year = {2022}
}

Contact

For questions, please contact Marko Mihajlovic ([email protected]) or raise an issue on GitHub.

[CVPR'22] COAP: Learning Compositional Occupancy of People

Related tags

Overview

COAP: Compositional Articulated Occupancy of People

Description

Installation

Optional Dependencies

Tutorials

Pretrained Models

Citation

Contact

Owner

Marko Mihajlovic

Implementation of CVPR'21: RfD-Net: Point Scene Understanding by Semantic Instance Reconstruction

Quick program made to generate alpha and delta tables for Hidden Markov Models

Robotics environments

A lightweight Python-based 3D network multi-agent simulator. Uses a cell-based congestion model. Calculates risk, loudness and battery capacities of the agents. Suitable for 3D network optimization tasks.

Simplified interface for TensorFlow (mimicking Scikit Learn) for Deep Learning

A small demonstration of using WebDataset with ImageNet and PyTorch Lightning

A framework for multi-step probabilistic time-series/demand forecasting models

Dilated Convolution for Semantic Image Segmentation

A GOOD REPRESENTATION DETECTS NOISY LABELS

Vrcwatch - Supply the local time to VRChat as Avatar Parameters through OSC

Versatile Generative Language Model

An open-source project for applying deep learning to medical scenarios

Raindrop strategy for Irregular time series

It is an open dataset for object detection in remote sensing images.

VQMIVC - Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-shot Voice Conversion

A facial recognition doorbell system using a Raspberry Pi

Code for the paper 'A High Performance CRF Model for Clothes Parsing'.

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

A script that trains a model to recognize handwritten digits using the MNIST data set.

Simple-Neural-Network From Scratch in Python