[ICCV21] Code for RetrievalFuse: Neural 3D Scene Reconstruction with a Database

Last update: Dec 22, 2022

Overview

RetrievalFuse

Paper | Project Page | Video

RetrievalFuse: Neural 3D Scene Reconstruction with a Database
Yawar Siddiqui, Justus Thies, Fangchang Ma, Qi Shan, Matthias Nießner, Angela Dai
ICCV2021

This repository contains the code for the ICCV 2021 paper RetrievalFuse, a novel approach for 3D reconstruction from low resolution distance field grids and from point clouds.

In contrast to traditional generative learned models which encode the full generative process into a neural network and can struggle with maintaining local details at the scene level, we introduce a new method that directly leverages scene geometry from the training database.

File and Folders

Broad code structure is as follows:

File / Folder	Description
`config/super_resolution`	Super-resolution experiment configs
`config/surface_reconstruction`	Surface reconstruction experiment configs
`config/base`	Defaults for configurations
`config/config_handler.py`	Config file parser
`data/splits`	Training and validation splits for different datasets
`dataset/scene.py`	SceneHandler class for managing access to scene data samples
`dataset/patched_scene_dataset.py`	Pytorch dataset class for scene data
`external/ChamferDistancePytorch`	For calculating rough chamfer distance between prediction and target while training
`model/attention.py`	Attention, folding and unfolding modules
`model/loss.py`	Loss functions
`model/refinement.py`	Refinement network
`model/retrieval.py`	Retrieval network
`model/unet.py`	U-Net model used as a backbone in refinement network
`runs/`	Checkpoint and visualizations for experiments dumped here
`trainer/train_retrieval.py`	Lightning module for training retrieval network
`trainer/train_refinement.py`	Lightning module for training refinement network
`util/arguments.py`	Argument parsing (additional arguments apart from those in config)
`util/filesystem_logger.py`	For copying source code for each run in the experiment log directory
`util/metrics.py`	Rough metrics for logging during training
`util/mesh_metrics.py`	Final metrics on meshes
`util/retrieval.py`	Script to dump retrievals once retrieval networks have been trained; needed for training refinement.
`util/visualizations.py`	Utility scripts for visualizations

Further, the data/ directory has the following layout

data                    # root data directory
├── sdf_008             # low-res (8^3) distance fields
    ├── 
   
         
        ├── 
    
     
        ├── 
     
      
        ├── 
      
       
        ...
    ├── 
       
         ... ├── sdf_016 # low-res (16^3) distance fields ├── 
        
          ├── 
         
           ├── 
          
            ├── 
           
             ... ├── 
            
              ... ├── sdf_064 # high-res (64^3) distance fields ├── 
             
               ├── 
              
                ├── 
               
                 ├── 
                
                  ... ├── 
                 
                   ... ├── pc_20K # point cloud inputs ├── 
                  
                    ├── 
                   
                     ├── 
                    
                      ├── 
                     
                       ... ├── 
                      
                        ... ├── splits # train/val splits ├── size # data needed by SceneHandler class (autocreated on first run) ├── occupancy # data needed by SceneHandler class (autocreated on first run)

Dependencies

Install the dependencies using pip ```bash pip install -r requirements.txt ``` Be sure that you pull the `ChamferDistancePytorch` submodule in `external`.

Data Preparation

For ShapeNetV2 and Matterport, get the appropriate meshes from the datasets. For 3DFRONT get the 3DFUTURE meshes and 3DFRONT scripts. For getting 3DFRONT meshes use our fork of 3D-FRONT-ToolBox to create room meshes.

Once you have the meshes, use our fork of sdf-gen to create distance field low-res inputs and high-res targets. For creating point cloud inputs simply use trimesh.sample.sample_surface (check util/misc/sample_scene_point_clouds). Place the processed data in appropriate directories:

data/sdf_008/ or data/sdf_016/ for low-res inputs
data/pc_20K/ for point clouds inputs
data/sdf_064/ for targets

Training the Retrieval Network

To train retrieval networks use the following command:

python trainer/train_retrieval.py --config config/<config> --val_check_interval 5 --experiment retrieval --wandb_main --sanity_steps 1

We provide some sample configurations for retrieval.

For super-resolution, e.g.

config/super_resolution/ShapeNetV2/retrieval_008_064.yaml
config/super_resolution/3DFront/retrieval_008_064.yaml
config/super_resolution/Matterport3D/retrieval_016_064.yaml

For surface-reconstruction, e.g.

config/surface_reconstruction/ShapeNetV2/retrieval_128_064.yaml
config/surface_reconstruction/3DFront/retrieval_128_064.yaml
config/surface_reconstruction/Matterport3D/retrieval_128_064.yaml

Once trained, create the retrievals for train/validation set using the following commands:

python util/retrieval.py  --mode map --retrieval_ckpt <trained_retrieval_ckpt> --config <retrieval_config>

python util/retrieval.py --mode compose --retrieval_ckpt <trained_retrieval_ckpt> --config <retrieval_config>

Training the Refinement Network

Use the following command to train the refinement network

python trainer/train_refinement.py --config <config> --val_check_interval 5 --experiment refinement --sanity_steps 1 --wandb_main --retrieval_ckpt <retrieval_ckpt>

Again, sample configurations for refinement are provided in the config directory.

For super-resolution, e.g.

config/super_resolution/ShapeNetV2/refinement_008_064.yaml
config/super_resolution/3DFront/refinement_008_064.yaml
config/super_resolution/Matterport3D/refinement_016_064.yaml

For surface-reconstruction, e.g.

config/surface_reconstruction/ShapeNetV2/refinement_128_064.yaml
config/surface_reconstruction/3DFront/refinement_128_064.yaml
config/surface_reconstruction/Matterport3D/refinement_128_064.yaml

Visualizations and Logs

Visualizations and checkpoints are dumped in the `runs/` directory. Logs are uploaded to the user's [Weights&Biases](https://wandb.ai/site) dashboard.

Citation

If you find our work useful in your research, please consider citing:

@inproceedings{siddiqui2021retrievalfuse,
  title = {RetrievalFuse: Neural 3D Scene Reconstruction with a Database},
  author = {Siddiqui, Yawar and Thies, Justus and Ma, Fangchang and Shan, Qi and Nie{\ss}ner, Matthias and Dai, Angela},
  booktitle = {Proc. International Conference on Computer Vision (ICCV)},
  month = oct,
  year = {2021},
  doi = {},
  month_numeric = {10}
}

License

The code from this repository is released under the MIT license.

[ICCV21] Code for RetrievalFuse: Neural 3D Scene Reconstruction with a Database

Related tags

Overview

RetrievalFuse

Paper | Project Page | Video

File and Folders

Dependencies

Data Preparation

Training the Retrieval Network

Training the Refinement Network

Visualizations and Logs

Citation

License

Owner

Yawar Nihal Siddiqui

(CVPR 2021) Lifting 2D StyleGAN for 3D-Aware Face Generation

Convert dog pictures into various painting styles. Try LimnPet

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

A quantum game modeling of pandemic (QHack 2022)

Awesome-google-colab - Google Colaboratory Notebooks and Repositories

A lane detection integrated Real-time Instance Segmentation based on YOLACT (You Only Look At CoefficienTs)

Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.

Pytorch implementation of the paper DocEnTr: An End-to-End Document Image Enhancement Transformer.

For IBM Quantum Challenge Africa 2021, 9 September (07:00 UTC) - 20 September (23:00 UTC).

Unofficial PyTorch Implementation of AHDRNet (CVPR 2019)

This repository accompanies the ACM TOIS paper "What can I cook with these ingredients?" - Understanding cooking-related information needs in conversational search

The implementation of ICASSP 2020 paper "Pixel-level self-paced learning for super-resolution"

Constructing interpretable quadratic accuracy predictors to serve as an objective function for an IQCQP problem that represents NAS under latency constraints and solve it with efficient algorithms.

Official Pytorch and JAX implementation of "Efficient-VDVAE: Less is more"

Code for "Human Pose Regression with Residual Log-likelihood Estimation", ICCV 2021 Oral

An MQA (Studio, originalSampleRate) identifier for lossless flac files written in Python.

a reimplementation of Optical Flow Estimation using a Spatial Pyramid Network in PyTorch

Knowledge Distillation Toolbox for Semantic Segmentation

Official Pytorch implementation for video neural representation (NeRV)

🔊 Audio and fastai v2