Official code repository for ICCV 2021 paper: Gravity-Aware Monocular 3D Human Object Reconstruction

Last update: Dec 09, 2022

Related tags

Overview

GraviCap

Official code repository for ICCV 2021 paper: Gravity-Aware Monocular 3D Human Object Reconstruction.

Gravity-Aware Monocular 3D Human-Object Reconstruction
Rishabh Dabral, Soshi Shimada, Arjun Jain, Christian Theobalt, Vladislav Golyanik
International Conference on Computer Vision (ICCV) 2021

We propose GraviCap - a new approach for joint 3D human-object reconstruction under gravity constraints. Given the 2D trajectory of an object and intrinsics, we recover the 3D trajectory of the object and the human in absolute metric units along with the camera tilt. The trajectories are recovered in absolute metric units while also estimating the camera tilt. We also release a new dataset with human and object annotations as a benchmark. This repository comes with a subset of the dataset for demo purposes. Please download the full dataset from the project page.

Installation

conda create --name gravicap python=3.6
git clone https://github.com/rishabhdabral/gravicap.git
pip install scipy

For running with VNect outputs, you may need to get access to VNect by registering here.

Quick Start

Demo on S6: We provide processed data of Sequence 6 from the Gravicap dataset.

python main.py --calib_path ./data/calibration_hps.pkl --annot_dir ./data/ --cam 17 --eps 0 --gt_pose_path ./data/S6.mddd 
# Use the --cam and --eps flags to set the camera and episode ids.

If VNect outputs for are available:

python main.py --calib_path ./data/calibration_hps.pkl --annot_dir ./data/ --cam 17 --eps 0 --gt_pose_path ./data/S6.mddd --mode vnect --pose_path path/to/vnect/output/directory
# Use the --cam and --eps flags to set the camera and episode ids.

License

Permission is hereby granted, free of charge, to any person or company obtaining a copy of this software and associated documentation files (the "Software") from the copyright holders to use the Software for any non-commercial purpose. Publication, redistribution and (re)selling of the software, of modifications, extensions, and derivates of it, and of other software containing portions of the licensed Software, are not permitted. The Copyright holder is permitted to publically disclose and advertise the use of the software by any licensee.

Packaging or distributing parts or whole of the provided software (including code, models and data) as is or as part of other software is prohibited. Commercial use of parts or whole of the provided software (including code, models and data) is strictly prohibited. Using the provided software for promotion of a commercial entity or product, or in any other manner which directly or indirectly results in commercial gains is strictly prohibited.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

References

@inproceedings{GraviCap2021, 
    author = {Dabral, Rishabh and Shimada, Soshi and Jain, Arjun and Theobalt, Christian and Golyanik, Vladislav}, 
    title = {Gravity-Aware Monocular 3D Human-Object Reconstruction}, 
    booktitle = {International Conference on Computer Vision (ICCV)}, 
    year = {2021} 
}

Official code repository for ICCV 2021 paper: Gravity-Aware Monocular 3D Human Object Reconstruction

Related tags

Overview

GraviCap

Installation

Quick Start

License

References

Owner

Rishabh Dabral

Caffe: a fast open framework for deep learning.

[NeurIPS'21] Projected GANs Converge Faster

A PyTorch implementation of Sharpness-Aware Minimization for Efficiently Improving Generalization

Simple tools for logging and visualizing, loading and training

Development kit for MIT Scene Parsing Benchmark

Links to works on deep learning algorithms for physics problems, TUM-I15 and beyond

A curated list of resources for Image and Video Deblurring

Forecasting with Gradient Boosted Time Series Decomposition

Deep Distributed Control of Port-Hamiltonian Systems

Unsupervised 3D Human Mesh Recovery from Noisy Point Clouds

Deep Learning applied to Integral data analysis

Evolutionary Scale Modeling (esm): Pretrained language models for proteins

Welcome to The Eigensolver Quantum School, a quantum computing crash course designed by students for students.

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with ONNX, TensorRT, ncnn, and OpenVINO supported.

Novel and high-performance medical image classification pipelines are heavily utilizing ensemble learning strategies

A Pythonic library for Nvidia Codec.

Resources related to our paper "CLIN-X: pre-trained language models and a study on cross-task transfer for concept extraction in the clinical domain"

[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction

(CVPR 2021) Back-tracing Representative Points for Voting-based 3D Object Detection in Point Clouds

ESL: Event-based Structured Light