This repo contains code to reproduce all experiments in Equivariant Neural Rendering

Last update: Nov 16, 2022

Overview

Equivariant Neural Rendering

This repo contains code to reproduce all experiments in Equivariant Neural Rendering by E. Dupont, M. A. Bautista, A. Colburn, A. Sankar, C. Guestrin, J. Susskind, Q. Shan, ICML 2020.

Pre-trained models

The weights for the trained chairs model are provided in trained-models/chairs.pt.

The other pre-trained models are located https://icml20-prod.cdn-apple.com/eqn-data/models/pre-trained_models.zip. They should be downloaded and placed into the trained-models directory. A small model chairs.pt is included in the git repo.

Examples

Requirements

The requirements can be directly installed from PyPi with pip install -r requirements.txt. Running the code requires python3.6 or higher.

Datasets

ShapeNet chairs: https://icml20-prod.cdn-apple.com/eqn-data/data/chairs.zip
ShapeNet cars: https://icml20-prod.cdn-apple.com/eqn-data/data/cars.zip
MugsHQ: https://icml20-prod.cdn-apple.com/eqn-data/data/mugs.zip
3D mountains: https://icml20-prod.cdn-apple.com/eqn-data/data/mountains.zip

each zip file will expand into 3 separate components and a readme e.g:

cars-train.zip
cars-val.zip
cars-test.zip
readme.txt containing the license terms.

A few example images are provided in imgs/example-data/.

The chairs and car datasets were created with the help of Vincent Sitzmann.

We thank Bernhard Vogl ([email protected]) for the lightmaps. The MugsHQ were rendered utilizing an environmental map located at http://dativ.at/lightprobes.

Usage

Training a model

To train a model, run the following:

python experiments.py config.json

This supports both single and multi-GPU training (see config.json for detailed training options). Note that you need to download the datasets before running this command.

Quantitative evaluation

To evaluate a model, run the following:

python evaluate_psnr.py

This will measure the performance (in PSNR) of a trained model on a test dataset.

Model exploration and visualization

The jupyter notebook exploration.ipynb shows how to use a trained model to infer a scene representation from a single image and how to use this representation to render novel views.

Coordinate system

The diagram below details the coordinate system we use for the voxel grid. Due to the manner in which images are stored in arrays and the way PyTorch's affine_grid and grid_sample functions work, this is a slightly unusual coordinate system. Note that theta and phi correspond to elevation and azimuth rotations of the camera around the scene representation. Note also that these are left handed rotations. Full details of the voxel rotation function can be found in transforms3d/rotations.py.

Citing

If you find this code useful in your research, consider citing with

@article{dupont2020equivariant,
  title={Equivariant Neural Rendering},
  author={Dupont, Emilien and Miguel Angel, Bautista and Colburn, Alex and Sankar, Aditya and Guestrin, Carlos and Susskind, Josh and Shan, Qi},
  journal={arXiv preprint arXiv:2006.07630},
  year={2020}
}

License

This project is licensed under the Apple Sample Code License

This repo contains code to reproduce all experiments in Equivariant Neural Rendering

Related tags

Overview

Equivariant Neural Rendering

Pre-trained models

Examples

Requirements

Datasets

Usage

Training a model

Quantitative evaluation

Model exploration and visualization

Coordinate system

Citing

License

Owner

Apple

Predicting Semantic Map Representations from Images with Pyramid Occupancy Networks

Creating Multi Task Models With Keras

This repository contains all the code and materials distributed in the 2021 Q-Programming Summer of Qode.

Blender Add-on that sets a Material's Base Color to one of Pantone's Colors of the Year

Representing Long-Range Context for Graph Neural Networks with Global Attention

[WACV 2022] Contextual Gradient Scaling for Few-Shot Learning

Official repository for Jia, Raghunathan, Göksel, and Liang, "Certified Robustness to Adversarial Word Substitutions" (EMNLP 2019)

This repository contains the reference implementation for our proposed Convolutional CRFs.

Referring Video Object Segmentation

Code for CVPR2021 "Visualizing Adapted Knowledge in Domain Transfer". Visualization for domain adaptation. #explainable-ai

Official source code to CVPR'20 paper, "When2com: Multi-Agent Perception via Communication Graph Grouping"

Pop-Out Motion: 3D-Aware Image Deformation via Learning the Shape Laplacian (CVPR 2022)

Implementation of ConvMixer in TensorFlow and Keras

Generative Adversarial Networks for High Energy Physics extended to a multi-layer calorimeter simulation

This project aims at providing a concise, easy-to-use, modifiable reference implementation for semantic segmentation models using PyTorch.

FreeSOLO for unsupervised instance segmentation, CVPR 2022

NAVER BoostCamp Final Project

Real-CUGAN - Real Cascade U-Nets for Anime Image Super Resolution

ML-based medical imaging using Azure

Companion repository to the paper accepted at the 4th ACM SIGSPATIAL International Workshop on Advances in Resilient and Intelligent Cities