3DMV jointly combines RGB color and geometric information to perform 3D semantic segmentation of RGB-D scans.

Last update: Feb 06, 2022

Overview

3DMV

3DMV jointly combines RGB color and geometric information to perform 3D semantic segmentation of RGB-D scans. This work is based on our ECCV'18 paper, 3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation.

Code

Installation:

Training is implemented with PyTorch. This code was developed under PyTorch 0.2 and recently upgraded to PyTorch 0.4.

Training:

See python train.py --help for all train options. Example train call:

python train.py --gpu 0 --train_data_list [path to list of train files] --data_path_2d [path to 2d image data] --class_weight_file [path to txt file of train histogram] --num_nearest_images 5 --model2d_path [path to pretrained 2d model]

Trained models: models.zip

Testing

See python test.py --help for all test options. Example test call:

python test.py --gpu 0 --scene_list [path to list of test scenes] --model_path [path to trained model.pth] --data_path_2d [path to 2d image data] --data_path_3d [path to test scene data] --num_nearest_images 5 --model2d_orig_path [path to pretrained 2d model]

Data:

This data has been precomputed from the ScanNet (v2) dataset.

Train data for ScanNet v2: 3dmv_scannet_v2_train.zip (6.2G)

2D train images can be processed from the ScanNet dataset using the 2d data preparation script in prepare_data
Expected file structure for 2D data:

scene0000_00/
|--color/
   |--[framenum].jpg
       ⋮
|--depth/
   |--[framenum].png   (16-bit pngs)
       ⋮
|--pose/
   |--[framenum].txt   (4x4 rigid transform as txt file)
       ⋮
|--label/    (if applicable)
   |--[framenum].png   (8-bit pngs)
       ⋮
scene0000_01/
⋮

Test scenes for ScanNet v2: 3dmv_scannet_v2_test_scenes.zip (110M)

Citation:

If you find our work useful in your research, please consider citing:

@inproceedings{dai20183dmv,
 author = {Dai, Angela and Nie{\ss}ner, Matthias},
 booktitle = {Proceedings of the European Conference on Computer Vision ({ECCV})},
 title = {3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation},
 year = {2018}
}

Contact:

If you have any questions, please email Angela Dai at [email protected].

3DMV jointly combines RGB color and geometric information to perform 3D semantic segmentation of RGB-D scans.

Related tags

Overview

3DMV

Code

Installation:

Training:

Testing

Data:

Citation:

Contact:

Owner

Владислав Молодцов

The first dataset of composite images with rationality score indicating whether the object placement in a composite image is reasonable.

RGBD-Net - This repository contains a pytorch lightning implementation for the 3DV 2021 RGBD-Net paper.

MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens

Continual World is a benchmark for continual reinforcement learning

Practical Single-Image Super-Resolution Using Look-Up Table

Official pytorch implementation of "DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion"

This a classic fintech problem that introduces real life difficulties such as data imbalance. Check out the notebook to find out more!

Rainbow DQN implementation that outperforms the paper's results on 40% of games using 20x less data 🌈

LSTM built using Keras Python package to predict time series steps and sequences. Includes sin wave and stock market data

Reusable constraint types to use with typing.Annotated

Source code for the BMVC-2021 paper "SimReg: Regression as a Simple Yet Effective Tool for Self-supervised Knowledge Distillation".

A transformer-based method for Healthcare Image Captioning in Vietnamese

PyTorch implementation of "Learning to Discover Cross-Domain Relations with Generative Adversarial Networks"

An official implementation of MobileStyleGAN in PyTorch

Code implementation of "Sparsity Probe: Analysis tool for Deep Learning Models"

Pytorch Implementation of Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic

Image Super-Resolution by Neural Texture Transfer

Tensorflow implementation of Semi-supervised Sequence Learning (https://arxiv.org/abs/1511.01432)

DTCN IJCAI - Sequential prediction learning framework and algorithm

Notebooks em Python para Métodos Eletromagnéticos