PyTorch implementation for MINE: Continuous-Depth MPI with Neural Radiance Fields

Last update: Dec 29, 2022

Related tags

Overview

MINE: Continuous-Depth MPI with Neural Radiance Fields

Project Page | Video

PyTorch implementation for our ICCV 2021 paper.

MINE: Towards Continuous Depth MPI with NeRF for Novel View Synthesis
Jiaxin Li*¹, Zijian Feng*¹, Qi She¹, Henghui Ding¹, Changhu Wang¹, Gim Hee Lee²
¹ByteDance, ²National University of Singapore
*denotes equal contribution

Our MINE takes a single image as input and densely reconstructs the frustum of the camera, through which we can easily render novel views of the given scene:

The overall architecture of our method:

Run training on the LLFF dataset:

Firstly, set up your conda environment:

conda env create -f environment.yml 
conda activate MINE

Download the pre-downsampled version of the LLFF dataset from Google Drive, unzip it and put it in the root of the project, then start training by running the following command:

sh start_training.sh MASTER_ADDR="localhost" MASTER_PORT=1234 N_NODES=1 GPUS_PER_NODE=2 NODE_RANK=0 WORKSPACE=/run/user/3861/vs_tmp DATASET=llff VERSION=debug EXTRA_CONFIG='{"training.gpus": "0,1"}'

You may find the tensorboard logs and checkpoints in the sub-working directory (WORKSPACE + VERSION).

Apart from the LLFF dataset, we experimented on the RealEstate10K, KITTI Raw and the Flowers Light Fields datasets - the data pre-processing codes and training flow for these datasets will be released later.

Running our pretrained models:

We release the pretrained models trained on the RealEstate10K, KITTI and the Flowers datasets:

Dataset	N	Input Resolution	Download Link
RealEstate10K	32	384x256	Google Drive
RealEstate10K	64	384x256	Google Drive
KITTI	32	768x256	Google Drive
KITTI	64	768x256	Google Drive
Flowers	32	512x384	Google Drive
Flowers	64	512x384	Google Drive

To run the models, download the checkpoint and the hyper-parameter yaml file and place them in the same directory, then run the following script:

python3 visualizations/image_to_video.py --checkpoint_path MINE_realestate10k_384x256_monodepth2_N64/checkpoint.pth --gpus 0 --data_path visualizations/home.jpg --output_dir .

Citation

If you find our work helpful to your research, please cite our paper:

@inproceedings{mine2021,
  title={MINE: Towards Continuous Depth MPI with NeRF for Novel View Synthesis},
  author={Jiaxin Li and Zijian Feng and Qi She and Henghui Ding and Changhu Wang and Gim Hee Lee},
  year={2021},
  booktitle={ICCV},
}

PyTorch implementation for MINE: Continuous-Depth MPI with Neural Radiance Fields

Related tags

Overview

MINE: Continuous-Depth MPI with Neural Radiance Fields

Project Page | Video

Run training on the LLFF dataset:

Running our pretrained models:

Citation

Owner

Zijian Feng

PyTorch implementation of ICLR 2022 paper PiCO: Contrastive Label Disambiguation for Partial Label Learning

[IROS'21] SurRoL: An Open-source Reinforcement Learning Centered and dVRK Compatible Platform for Surgical Robot Learning

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

3rd Place Solution of the Traffic4Cast Core Challenge @ NeurIPS 2021

Advanced Signal Processing Notebooks and Tutorials

Peek-a-Boo: What (More) is Disguised in a Randomly Weighted Neural Network, and How to Find It Efficiently

Free course that takes you from zero to Reinforcement Learning PRO 🦸🏻‍🦸🏽

A rule learning algorithm for the deduction of syndrome definitions from time series data.

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Semantic Segmentation.

Code for ACL2021 paper Consistency Regularization for Cross-Lingual Fine-Tuning.

An official source code for paper Deep Graph Clustering via Dual Correlation Reduction, accepted by AAAI 2022

Masked regression code - Masked Regression

This is the Pytorch implementation of Progressive Attentional Manifold Alignment.

Google Landmark Recogntion and Retrieval 2021 Solutions

The Multi-Mission Maximum Likelihood framework (3ML)

Fast and robust clustering of point clouds generated with a Velodyne sensor.

An excellent hash algorithm combining classical sponge structure and RNN.

Securetar - A streaming wrapper around python tarfile and allow secure handling files and support encryption

Learning-Augmented Dynamic Power Management

This Deep Learning Model Predicts that from which disease you are suffering.

PyTorch implementation for MINE: Continuous-Depth MPI with Neural Radiance Fields

Related tags

Overview

MINE: Continuous-Depth MPI with Neural Radiance Fields

Project Page | Video

Run training on the LLFF dataset:

Running our pretrained models:

Citation

Owner

Zijian Feng

PyTorch implementation of ICLR 2022 paper PiCO: Contrastive Label Disambiguation for Partial Label Learning

[IROS'21] SurRoL: An Open-source Reinforcement Learning Centered and dVRK Compatible Platform for Surgical Robot Learning

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

3rd Place Solution of the Traffic4Cast Core Challenge @ NeurIPS 2021

Advanced Signal Processing Notebooks and Tutorials

Peek-a-Boo: What (More) is Disguised in a Randomly Weighted Neural Network, and How to Find It Efficiently

Free course that takes you from zero to Reinforcement Learning PRO 🦸🏻‍🦸🏽

A rule learning algorithm for the deduction of syndrome definitions from time series data.

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Semantic Segmentation.

Code for ACL2021 paper Consistency Regularization for Cross-Lingual Fine-Tuning.

An official source code for paper Deep Graph Clustering via Dual Correlation Reduction, accepted by AAAI 2022

Masked regression code - Masked Regression

​ This is the Pytorch implementation of Progressive Attentional Manifold Alignment.

Google Landmark Recogntion and Retrieval 2021 Solutions

The Multi-Mission Maximum Likelihood framework (3ML)

Fast and robust clustering of point clouds generated with a Velodyne sensor.

An excellent hash algorithm combining classical sponge structure and RNN.

Securetar - A streaming wrapper around python tarfile and allow secure handling files and support encryption

Learning-Augmented Dynamic Power Management

This Deep Learning Model Predicts that from which disease you are suffering.

This is the Pytorch implementation of Progressive Attentional Manifold Alignment.