OcclusionFusion: realtime dynamic 3D reconstruction based on single-view RGB-D

Last update: Dec 15, 2022

Overview

OcclusionFusion (CVPR'2022)

Project Page | Paper | Video

Overview

This repository contains the code for the CVPR 2022 paper OcclusionFusion, where we introduce a novel method to calculate occlusion-aware 3D motion to guide dynamic 3D reconstruction.

In our technique, the motion of visible regions is first estimated and combined with temporal information to infer the motion of the occluded regions through an LSTM-involved graph neural network.

Currently, we provide a pretrained model and a demo. Code for data pre-processing, network training and evaluation will be available soon.

Setup

We use python 3.8.10, pytorch-1.8.0 and pytorch-geometric-1.7.2.

conda create -n occlusionfu python==3.8.10
conda activate occlusionfu
pip install -r requirements.txt
conda install pytorch==1.8.0 torchvision==0.9.0 torchaudio==0.8.0 cudatoolkit=10.2 -c pytorch
pip install torch-scatter==2.0.8 -f https://pytorch-geometric.com/whl/torch-1.8.0+cu102.html
pip install torch-sparse==0.6.12 -f https://pytorch-geometric.com/whl/torch-1.8.0+cu102.html
pip install torch-cluster==1.5.9 -f https://pytorch-geometric.com/whl/torch-1.8.0+cu102.html
pip install torch-spline-conv==1.2.1 -f https://pytorch-geometric.com/whl/torch-1.8.0+cu102.html
pip install torch-geometric==1.7.2

Running the demo

Run the demo with the pretrained model and prepared inputs:

python demo.py

Visualize the input and output:

python visualize.py

The defualt setting of visualize.py will render the network's input and output to a video as follow. You can also change the setting to view the network's input and output with Open3D viewer.

Citation

If you find our work useful in your research, please consider citing:

@inproceedings{lin2022occlusionfusion,
    title={OcclusionFusion: Occlusion-aware Motion Estimation for Real-time Dynamic 3D Reconstruction}, 
    author={Wenbin Lin, Chengwei Zheng, Jun-Hai Yong, Feng Xu}, 
    journal={Conference on Computer Vision and Pattern Recognition (CVPR)}, 
    year={2022}
}

OcclusionFusion: realtime dynamic 3D reconstruction based on single-view RGB-D

Related tags

Overview

OcclusionFusion (CVPR'2022)

Project Page | Paper | Video

Overview

Setup

Running the demo

Citation

Owner

Wenbin Lin

Pytorch Implementation of Adversarial Deep Network Embedding for Cross-Network Node Classification

Repository containing detailed experiments related to the paper "Memotion Analysis through the Lens of Joint Embedding".

The repository includes the code for training cell counting applications. (Keras + Tensorflow)

How Do Adam and Training Strategies Help BNNs Optimization? In ICML 2021.

Official implementation for "Low-light Image Enhancement via Breaking Down the Darkness"

This repository contains the official implementation code of the paper Transformer-based Feature Reconstruction Network for Robust Multimodal Sentiment Analysis

A python script to lookup Passport Index Dataset

This is code of book "Learn Deep Learning with PyTorch"

Official PyTorch Implementation of Learning Architectures for Binary Networks

Code for "FPS-Net: A convolutional fusion network for large-scale LiDAR point cloud segmentation".

A PyTorch Lightning solution to training OpenAI's CLIP from scratch.

PyTorch implementation of paper "Neural Scene Flow Fields for Space-Time View Synthesis of Dynamic Scenes", CVPR 2021

SeqAttack: a framework for adversarial attacks on token classification models

We will see a basic program that is basically a hint to brute force attack to crack passwords. In other words, we will make a program to Crack Any Password Using Python. Show some ❤️ by starring this repository!

MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets)

QT Py Media Knob using rotary encoder & neopixel ring

You Only Hypothesize Once: Point Cloud Registration with Rotation-equivariant Descriptors

Official repository for GCR rerank, a GCN-based reranking method for both image and video re-ID

A Broad Study on the Transferability of Visual Representations with Contrastive Learning

Trafffic prediction analysis using hybrid models - Machine Learning