KinectFusion implemented in Python with PyTorch

Last update: Jan 03, 2023

Overview

KinectFusion implemented in Python with PyTorch

This is a lightweight Python implementation of KinectFusion. All the core functions (TSDF volume, frame-to-model tracking, point-to-plane ICP, raycasting, TSDF fusion, etc.) are implemented using pure PyTorch, i.e. no custom CUDA kernels.

Although without any custom CUDA functions, the system could still run at a fairly fast speed: The demo reconstructs the TUM fr1_desk sequence into a 225 x 171 x 111 TSDF volume with 2cm resolution at round 17 FPS with a single RTX-2080 GPU (~1.5 FPS in CPU mode)

Note that this project is mainly for study purpose, and is not fully optimized for accurate camera tracking.

Requirements

The core functionalities were implemented in PyTorch (1.10). Open3D (0.14.0) is used for visualisation. Other important dependancies include:

numpy==1.21.2
opencv-python==4.5.5
imageio==2.14.1
scikit-image==0.19.1
trimesh==3.9.43

You can create an anaconda environment called kinfu with the required dependencies by running:

conda env create -f environment.yml
conda activate kinfu

Data Preparation

The code was tested on TUM dataset. After downloading the raw sequences, you will need to run the pre-processing script under dataset/. For example:

python dataset/preprocess.py --config configs/fr1_desk.yaml

There are some example config files under configs/ which correspond to different sequences. You need to replace data_root to your own sequence directory before running the script. After running the script a new directory processed/ will appear under your sequence directory.

Run

After obtaining the processed sequence, you can simply run kinfu.py. For example:

python kinfu.py --config configs/fr1_desk.yaml --save_dir reconstruct/fr1_desk

which will perform the tracking and mapping headlessly and save the results. Or you could run:

python kinfu_gui.py --config configs/fr1_desk.yaml

If you want to visualize the tracking and reconstruction process on-the-fly.

Acknowledgement

Part of the tracking code was borrowed and modified from DeepIC. Also thank Binbin Xu for implementing part of the TSDF volume code which is inspired by Andy Zeng's tsdf-fusion-python.

KinectFusion implemented in Python with PyTorch

Related tags

Overview

KinectFusion implemented in Python with PyTorch

Requirements

Data Preparation

Run

Acknowledgement

Owner

Jingwen Wang

Pytorch Implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension)

A Deep Learning Framework for Neural Derivative Hedging

Benchmark library for high-dimensional HPO of black-box models based on Weighted Lasso regression

Very large and sparse networks appear often in the wild and present unique algorithmic opportunities and challenges for the practitioner

Using this codebase as a tool for my own research. Making some modifications to the original repo for my own purposes.

Hack Camera, Microphone, Location, Clipboard With Just a Link. Also, Get Many Details About Victim's Device. And So On...

PyTorch implementation of SQN based on CloserLook3D's encoder

A simple editor for captions in .SRT file extension

DrQ-v2: Improved Data-Augmented Reinforcement Learning

Qcover is an open source effort to help exploring combinatorial optimization problems in Noisy Intermediate-scale Quantum(NISQ) processor.

Official code for article "Expression is enough: Improving traﬀic signal control with advanced traﬀic state representation"

The story of Chicken for Club Bing

[TIP 2021] SADRNet: Self-Aligned Dual Face Regression Networks for Robust 3D Dense Face Alignment and Reconstruction

NeurIPS 2021 Datasets and Benchmarks Track

Code repository for paper `Skeleton Merger: an Unsupervised Aligned Keypoint Detector`.

How to Learn a Domain Adaptive Event Simulator? ACM MM, 2021

TensorFlow Implementation of Unsupervised Cross-Domain Image Generation

Single Image Random Dot Stereogram for Tensorflow

LSTM and QRNN Language Model Toolkit for PyTorch

Multilingual Image Captioning