MVSDF - Learning Signed Distance Field for Multi-view Surface Reconstruction

Last update: Dec 20, 2022

Related tags

Overview

MVSDF - Learning Signed Distance Field for Multi-view Surface Reconstruction

This is the official implementation for the ICCV 2021 paper Learning Signed Distance Field for Multi-view Surface Reconstruction

In this work, we introduce a novel neural surface reconstruction framework that leverages the knowledge of stereo matching and feature consistency to optimize the implicit surface representation. More specifically, we apply a signed distance field (SDF) and a surface light field to represent the scene geometry and appearance respectively. The SDF is directly supervised by geometry from stereo matching, and is refined by optimizing the multi-view feature consistency and the fidelity of rendered images. Our method is able to improve the robustness of geometry estimation and support reconstruction of complex scene topologies. Extensive experiments have been conducted on DTU, EPFL and Tanks and Temples datasets. Compared to previous state-of-the-art methods, our method achieves better mesh reconstruction in wide open scenes without masks as input.

How to Use

Environment Setup

The code is tested in the following environment (manually installed packages only). The newer version of the packages should also be fine.

dependencies:
  - cudatoolkit=10.2.89
  - numpy=1.19.2
  - python=3.8.8
  - pytorch=1.7.1
  - tqdm=4.60.0
  - pip:
    - cvxpy==1.1.12
    - gputil==1.4.0
    - imageio==2.9.0
    - open3d==0.13.0
    - opencv-python==4.5.1.48
    - pyhocon==0.3.57
    - scikit-image==0.18.3
    - scikit-learn==0.24.2
    - trimesh==3.9.13
    - pybind11==2.9.0

Data Preparation

Download preprocessed DTU datasets from here

Training

cd code
python training/exp_runner.py --data_dir <DATA_DIR>/scan<SCAN>/imfunc4 --batch_size 8 --nepoch 1800 --expname dtu_<SCAN>

The results will be written in exps/mvsdf_dtu_.

Trained Models

Download trained models and put them in exps folder. This set of models achieve the following results.

	Chamfer	PSNR
24	0.846	24.67
37	1.894	20.15
40	0.895	25.15
55	0.435	23.19
63	1.067	26.24
65	0.903	26.9
69	0.746	26.54
83	1.241	25.15
97	1.009	25.71
105	1.320	26.48
106	0.867	28.81
110	0.842	23.16
114	0.340	27.51
118	0.472	28.46
122	0.466	27.71
Mean	0.890	25.72

Testing

python evaluation/eval.py --data_dir <DATA_DIR>/scan<SCAN>/imfunc4 --expname dtu_<SCAN> [--eval_rendering]

add --eval_rendering flag to generate and evaluate rendered images. The results will be written in evals/mvsdf_dtu_.

Trimming

cd mesh_cut
python setup.py build_ext -i  # compile
python mesh_cut.py 
    
    
      [--thresh 15 --smooth 10]

Note that this part of code can only be used for research purpose. Please refer to mesh_cut/IBFS/license.txt

Evaluation

Apart from the official implementation, you can also use my re-implemented evaluation script.

Citation

If you find our work useful in your research, please kindly cite

@article{zhang2021learning,
	title={Learning Signed Distance Field for Multi-view Surface Reconstruction},
	author={Zhang, Jingyang and Yao, Yao and Quan, Long},
	journal={International Conference on Computer Vision (ICCV)},
	year={2021}
}

MVSDF - Learning Signed Distance Field for Multi-view Surface Reconstruction

Related tags

Overview

MVSDF - Learning Signed Distance Field for Multi-view Surface Reconstruction

How to Use

Environment Setup

Data Preparation

Training

Trained Models

Testing

Trimming

Evaluation

Citation

Owner

reimpliment of DFANet: Deep Feature Aggregation for Real-Time Semantic Segmentation

Train the HRNet model on ImageNet

This GitHub repository contains code used for plots in NeurIPS 2021 paper 'Stochastic Multi-Armed Bandits with Control Variates.'

PyMove is a Python library to simplify queries and visualization of trajectories and other spatial-temporal data

NumQMBasic - A mini-course offered to Undergrad physics students

Official pytorch implementation of Active Learning for deep object detection via probabilistic modeling (ICCV 2021)

ExCon: Explanation-driven Supervised Contrastive Learning

Online Multi-Granularity Distillation for GAN Compression (ICCV2021)

A unified 3D Transformer Pipeline for visual synthesis

A PyTorch implementation of QANet.

Pre-Trained Image Processing Transformer (IPT)

Deep Learning Visuals contains 215 unique images divided in 23 categories

GLIP: Grounded Language-Image Pre-training

MRQy is a quality assurance and checking tool for quantitative assessment of magnetic resonance imaging (MRI) data.

Simple Tensorflow implementation of Toward Spatially Unbiased Generative Models (ICCV 2021)

A small library for creating and manipulating custom JAX Pytree classes

A Pytorch Implementation for Compact Bilinear Pooling.

Amazon Forest Computer Vision: Satellite Image tagging code using PyTorch / Keras with lots of PyTorch tricks

A annotation of yolov5-5.0

A sketch extractor for anime/illustration.