[ICCV 2021 Oral] NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo

Last update: Dec 24, 2022

Overview

NerfingMVS

Project Page | Paper | Video | Data

NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo
Yi Wei, Shaohui Liu, Yongming Rao, Wang Zhao, Jiwen Lu, Jie Zhou
ICCV 2021 (Oral Presentation)

Installation

Pull NerfingMVS repo.

git clone --recursive [email protected]:weiyithu/NerfingMVS.git

Install python packages with anaconda.

conda create -n NerfingMVS python=3.7
conda activate NerfingMVS
conda install pytorch==1.7.1 torchvision==0.8.2 torchaudio==0.7.2 -c pytorch
pip install -r requirements.txt

We use COLMAP to calculate poses and sparse depths. However, original COLMAP does not have fusion mask for each view. Thus, we add masks to COLMAP and denote it as a submodule. Please follow https://colmap.github.io/install.html to install COLMAP in ./colmap folder.

Usage

Download 8 ScanNet scene data used in the paper here and put them under ./data folder. We also upload final results and checkpoints of each scene here.
Run NerfingMVS
```
sh run.sh $scene_name
```
The whole procedure takes about 3.5 hours on one NVIDIA GeForce RTX 2080 GPU, including COLMAP, depth priors training, NeRF training, filtering and evaluation. COLMAP can be accelerated with multiple GPUs.You will get per-view depth maps in ./logs/$scene_name/filter. Note that these depth maps have been aligned with COLMAP poses. COLMAP results will be saved in ./data/$scene_name while others will be preserved in ./logs/$scene_name

Run on Your Own Data!

Place your data with the following structure:
```
NerfingMVS
|───data
|    |──────$scene_name
|    |   |   train.txt
|    |   |──────images
|    |   |    |    001.jpg
|    |   |    |    002.jpg
|    |   |    |    ...
|───configs
|    $scene_name.txt
|     ...
```
train.txt contains names of all the images. Images can be renamed arbitrarily and '001.jpg' is just an example. You also need to imitate ScanNet scenes to create a config file in ./configs. Note that factor parameter controls the resolution of output depth maps. You also should adjust depth_N_iters, depth_H, depth_W in options.py accordingly.
Run NerfingMVS without evaluation
```
sh demo.sh $scene_name
```
Since our work currently relies on COLMAP, the results are dependent on the quality of the acquired poses and sparse reconstruction from COLMAP.

Acknowledgement

Our code is based on the pytorch implementation of NeRF: NeRF-pytorch. We also refer to mannequin challenge.

Citation

If you find our work useful in your research, please consider citing:

@inproceedings{wei2021nerfingmvs,
  author    = {Wei, Yi and Liu, Shaohui and Rao, Yongming and Zhao, Wang and Lu, Jiwen and Zhou, Jie},
  title     = {NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo},
  booktitle = {ICCV},
  year = {2021}
}

[ICCV 2021 Oral] NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo

Related tags

Overview

NerfingMVS

Project Page | Paper | Video | Data

Installation

Usage

Run on Your Own Data!

Acknowledgement

Citation

Owner

Yi Wei

Laser device for neutralizing - mosquitoes, weeds and pests

DeepSTD: Mining Spatio-temporal Disturbances of Multiple Context Factors for Citywide Traffic Flow Prediction

This is the official repository for our paper: ''Pruning Self-attentions into Convolutional Layers in Single Path''.

Cross View SLAM

Sequence lineage information extracted from RKI sequence data repo

A tool to visualise the results of AlphaFold2 and inspect the quality of structural predictions

Contrastive Language-Image Pretraining

PCGNN - Procedural Content Generation with NEAT and Novelty

TVNet: Temporal Voting Network for Action Localization

CTRMs: Learning to Construct Cooperative Timed Roadmaps for Multi-agent Path Planning in Continuous Spaces

An open-source online reverse dictionary.

Multi-Task Temporal Shift Attention Networks for On-Device Contactless Vitals Measurement (NeurIPS 2020)

Official Pytorch implementation of "Learning to Estimate Robust 3D Human Mesh from In-the-Wild Crowded Scenes", CVPR 2022

Face Detection and Alignment using Multi-task Cascaded Convolutional Networks (MTCNN)

an implementation of softmax splatting for differentiable forward warping using PyTorch

Python calculations for the position of the sun and moon.

Self-supervised learning algorithms provide a way to train Deep Neural Networks in an unsupervised way using contrastive losses

Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"

ObjectDetNet is an easy, flexible, open-source object detection framework

A library built upon PyTorch for building embeddings on discrete event sequences using self-supervision