Python code to fuse multiple RGB-D images into a TSDF voxel volume.

Last update: Jan 03, 2023

Overview

Volumetric TSDF Fusion of RGB-D Images in Python

This is a lightweight python script that fuses multiple registered color and depth images into a projective truncated signed distance function (TSDF) volume, which can then be used to create high quality 3D surface meshes and point clouds. Tested on Ubuntu 16.04.

An older CUDA/C++ version can be found here.

Requirements

Python 2.7+ with NumPy, PyCUDA, OpenCV, Scikit-image and Numba. These can be quickly installed/updated by running the following:
```
pip install --user numpy opencv-python scikit-image numba
```
[Optional] GPU acceleration requires an NVIDA GPU with CUDA and PyCUDA:
```
pip install --user pycuda
```

Demo

This demo fuses 1000 RGB-D images from the 7-scenes dataset into a 405 x 264 x 289 projective TSDF voxel volume with 2cm resolution at about 30 FPS in GPU mode (0.4 FPS in CPU mode), and outputs a 3D mesh mesh.ply which can be visualized with a 3D viewer like Meshlab.

Note: color images are saved as 24-bit PNG RGB, depth images are saved as 16-bit PNG in millimeters.

python demo.py

Seen In

References

Citing

This repository is a part of 3DMatch Toolbox. If you find this code useful in your work, please consider citing:

@inproceedings{zeng20163dmatch,
    title={3DMatch: Learning Local Geometric Descriptors from RGB-D Reconstructions},
    author={Zeng, Andy and Song, Shuran and Nie{\ss}ner, Matthias and Fisher, Matthew and Xiao, Jianxiong and Funkhouser, Thomas},
    booktitle={CVPR},
    year={2017}
}

Python code to fuse multiple RGB-D images into a TSDF voxel volume.

Related tags

Overview

Volumetric TSDF Fusion of RGB-D Images in Python

Requirements

Demo

Seen In

References

Citing

Owner

Andy Zeng

Walk with fastai

Official implementation for "QS-Attn: Query-Selected Attention for Contrastive Learning in I2I Translation" (CVPR 2022)

Implement Decoupled Neural Interfaces using Synthetic Gradients in Pytorch

NALSM: Neuron-Astrocyte Liquid State Machine

CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation

Text mining project; Using distilBERT to predict authors in the classification task authorship attribution.

Parameterized Explainer for Graph Neural Network

[CVPR-2021] UnrealPerson: An adaptive pipeline for costless person re-identification

PyTorch evaluation code for Delving Deep into the Generalization of Vision Transformers under Distribution Shifts.

Implementation of the SUMO (Slim U-Net trained on MODA) model

OpenL3: Open-source deep audio and image embeddings

Pytorch implementation of FlowNet by Dosovitskiy et al.

MMFlow is an open source optical flow toolbox based on PyTorch

Classifying audio using Wavelet transform and deep learning

Quantify the difference between two arbitrary curves in space

Occlusion robust 3D face reconstruction model in CFR-GAN (WACV 2022)

Boost learning for GNNs from the graph structure under challenging heterophily settings. (NeurIPS'20)

Repository of continual learning papers

CAMPARI: Camera-Aware Decomposed Generative Neural Radiance Fields

Time Series Cross-Validation -- an extension for scikit-learn