Official implementation of NeurIPS'2021 paper TransformerFusion

Last update: Dec 25, 2022

Related tags

Overview

TransformerFusion: Monocular RGB Scene Reconstruction using Transformers

Project Page | Paper | Video

TransformerFusion: Monocular RGB Scene Reconstruction using Transformers
Aljaz Bozic, Pablo Palafox, Justus Thies, Angela Dai, Matthias Niessner
NeurIPS 2021

TODOs

Evaluation code and metrics (with ground truth data)
Model code (with pretrained checkpoint)
Test-time reconstruction code
Training (and evaluation) data preparation scripts

How to install the framework

Clone the repository with submodules:

git clone --recurse-submodules https://github.com/AljazBozic/TransformerFusion.git

Create Conda environment:

conda env create -f environment.yml

Compile local C++/CUDA dependencies:

conda activate tf
cd csrc
python setup.py install

Evaluate the reconstructions

We evaluate method performance on the test scenes of ScanNet dataset.

We compare scene reconstructions to the ground truth meshes, obtained with fusion of RGB-D data. Since the ground truth meshes are not complete, we additionally compute occlusion masks of RGB-D scans, to not penalize the reconstructions that are more complete than the ground truth meshes.

You can download both ground truth meshes and occlusion masks here. To evaluate the reconstructions, you need to place them into data/reconstructions, and extract the ground truth data to data/groundtruth. The reconstructions are expected to be named as ScanNet test scenes, e.g. scene0733_00.ply. The following script computes evaluation metrics over all provided scene meshes:

conda activate tf
python src/evaluation/eval.py

Citation

If you find our work useful in your research, please consider citing:

@article{
bozic2021transformerfusion,
title={TransformerFusion: Monocular RGB Scene Reconstruction using Transformers},
author={Bozic, Aljaz and Palafox, Pablo and Thies, Justus and Dai, Angela and Niessner, Matthias},
journal={Proc. Neural Information Processing Systems (NeurIPS)},
year={2021}}

Related work

Some other related work on monocular RGB reconstruction of indoor scenes:

License

The code from this repository is released under the MIT license.

Official implementation of NeurIPS'2021 paper TransformerFusion

Related tags

Overview

TransformerFusion: Monocular RGB Scene Reconstruction using Transformers

Project Page | Paper | Video

TODOs

How to install the framework

Evaluate the reconstructions

Citation

Related work

License

Owner

Aljaz Bozic

Official repository for the paper "Instance-Conditioned GAN"

This repository contains code accompanying the paper "An End-to-End Chinese Text Normalization Model based on Rule-Guided Flat-Lattice Transformer"

Transformers are Graph Neural Networks!

Evaluation suite for large-scale language models.

Face Library is an open source package for accurate and real-time face detection and recognition

Data, model training, and evaluation code for "PubTables-1M: Towards a universal dataset and metrics for training and evaluating table extraction models".

Official PyTorch implementation of the Fishr regularization for out-of-distribution generalization

Learn other languages using artificial intelligence with python.

A repository built on the Flow software package to explore cyber-security attacks on intelligent transportation systems.

(AAAI 2021) Progressive One-shot Human Parsing

Adversarial Attacks on Probabilistic Autoregressive Forecasting Models.

MDETR: Modulated Detection for End-to-End Multi-Modal Understanding

Learning an Adaptive Meta Model-Generator for Incrementally Updating Recommender Systems

Volumetric Correspondence Networks for Optical Flow, NeurIPS 2019.

This repository compare a selfie with images from identity documents and response if the selfie match.

学习 python3 以来写的一些垃圾玩具……

ReAct: Out-of-distribution Detection With Rectified Activations

Auto Seg-Loss: Searching Metric Surrogates for Semantic Segmentation

Stochastic Tensor Optimization for Robot Motion - A GPU Robot Motion Toolkit

PyTorch implementation of DeepDream algorithm

Official implementation of NeurIPS'2021 paper TransformerFusion

Related tags

Overview

TransformerFusion: Monocular RGB Scene Reconstruction using Transformers

Project Page | Paper | Video

TODOs

How to install the framework

Evaluate the reconstructions

Citation

Related work

License

Owner

Aljaz Bozic

Official repository for the paper "Instance-Conditioned GAN"

This repository contains code accompanying the paper "An End-to-End Chinese Text Normalization Model based on Rule-Guided Flat-Lattice Transformer"

Transformers are Graph Neural Networks!

Evaluation suite for large-scale language models.

Face Library is an open source package for accurate and real-time face detection and recognition

Data, model training, and evaluation code for "PubTables-1M: Towards a universal dataset and metrics for training and evaluating table extraction models".

Official PyTorch implementation of the Fishr regularization for out-of-distribution generalization

Learn other languages ​​using artificial intelligence with python.

A repository built on the Flow software package to explore cyber-security attacks on intelligent transportation systems.

(AAAI 2021) Progressive One-shot Human Parsing

Adversarial Attacks on Probabilistic Autoregressive Forecasting Models.

MDETR: Modulated Detection for End-to-End Multi-Modal Understanding

Learning an Adaptive Meta Model-Generator for Incrementally Updating Recommender Systems

Volumetric Correspondence Networks for Optical Flow, NeurIPS 2019.

This repository compare a selfie with images from identity documents and response if the selfie match.

学习 python3 以来写的一些垃圾玩具……

ReAct: Out-of-distribution Detection With Rectified Activations

Auto Seg-Loss: Searching Metric Surrogates for Semantic Segmentation

Stochastic Tensor Optimization for Robot Motion - A GPU Robot Motion Toolkit

PyTorch implementation of DeepDream algorithm

Learn other languages using artificial intelligence with python.