Lighthouse: Predicting Lighting Volumes for Spatially-Coherent Illumination

Pratul P. Srinivasan, Ben Mildenhall, Matthew Tancik, Jonathan T. Barron, Richard Tucker, Noah Snavely, CVPR 2020

This release contains code for predicting incident illumination at any 3D location within a scene. The algorithm takes a narrow-baseline stereo pair of RGB images as input, and predicts a multiscale RGBA lighting volume. Spatially-varying lighting within the volume can then be computed by standard volume rendering.

Running a pretrained model

interiornet_test.py contains an example script for running a pretrained model on the test set (formatted as .npz files). Please download and extract the pretrained model and testing examples files, and then include the corresponding file/directory names as command line flags when running interiornet_test.py.

Example usage (edit paths to match your directory structure): python -m lighthouse.interiornet_test --checkpoint_dir="lighthouse/model/" --data_dir="lighthouse/testset/" --output_dir="lighthouse/output/"

Training

Please refer to the train.py for code to use for training your own model.

This model was trained using the InteriorNet dataset. It may be helpful to read data_loader.py to get an idea of how we organized the InteriorNet dataset for training.

To train with the perceptual loss based on VGG features (as done in the paper), please download the imagenet-vgg-verydeep-19.mat pretrained VGG model, and include the corresponding path as a command line flag when running train.py.

Example usage (edit paths to match your directory structure): python -m lighthouse.train --vgg_model_file="lighthouse/model/imagenet-vgg-verydeep-19.mat" --load_dir="" --data_dir="lighthouse/data/InteriorNet/" --experiment_dir=lighthouse/training/

Extra

This model is quite memory-hungry, and we used a NVIDIA Tesla V100 GPU for training and testing with a single example per minibatch. You may run into memory constraints when training on a GPU with less than 16 GB memory or testing on a GPU with less than 12 GB memory. If you wish to train a model on a GPU with <16 GB memory, you may want to try removing the finest volume in the multiscale representation (see the model parameters in train.py).

If you find this code helpful, please cite our paper: @article{Srinivasan2020, author = {Pratul P. Srinivasan, Ben Mildenhall, Matthew Tancik, Jonathan T. Barron, Richard Tucker, Noah Snavely}, title = {Lighthouse: Predicting Lighting Volumes for Spatially-Coherent Illumination}, journal = {CVPR}, year = {2020}, }

Lighthouse: Predicting Lighting Volumes for Spatially-Coherent Illumination

Related tags

Overview

Lighthouse: Predicting Lighting Volumes for Spatially-Coherent Illumination

Pratul P. Srinivasan, Ben Mildenhall, Matthew Tancik, Jonathan T. Barron, Richard Tucker, Noah Snavely, CVPR 2020

Running a pretrained model

Training

Extra

Owner

Pratul Srinivasan

Instance-level Image Retrieval using Reranking Transformers

State of the art Semantic Sentence Embeddings

AWS documentation corpus for zero-shot open-book question answering.

This is the reference implementation for "Coresets via Bilevel Optimization for Continual Learning and Streaming"

Classifying audio using Wavelet transform and deep learning

This is an official implementation for "Self-Supervised Learning with Swin Transformers".

A PyTorch version of You Only Look at One-level Feature object detector

Deep Learning Interviews book: Hundreds of fully solved job interview questions from a wide range of key topics in AI.

A foreign language learning aid using a neural network to predict probability of translating foreign words

ChebLieNet, a spectral graph neural network turned equivariant by Riemannian geometry on Lie groups.

A Multi-modal Model Chinese Spell Checker Released on ACL2021.

Revisting Open World Object Detection

[CVPR'20] TTSR: Learning Texture Transformer Network for Image Super-Resolution

Bottom-up Human Pose Estimation

A curated list of awesome Model-Based RL resources

Tesla Light Show xLights Guide With python

Phonetic PosteriorGram (PPG)-Based Voice Conversion (VC)

Google Landmark Recogntion and Retrieval 2021 Solutions

Tracking code for the winner of track 1 in the MMP-Tracking Challenge at ICCV 2021 Workshop.

GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation. (CVPR 2021)