Official PyTorch Implementation of paper "Deep 3D Mask Volume for View Synthesis of Dynamic Scenes", ICCV 2021.

Last update: Oct 12, 2022

Related tags

Deep Learning deep-3dmask

Overview

Deep 3D Mask Volume for View Synthesis of Dynamic Scenes

Official PyTorch Implementation of paper "Deep 3D Mask Volume for View Synthesis of Dynamic Scenes", ICCV 2021.

Kai-En Lin¹, Lei Xiao², Feng Liu², Guowei Yang¹, Ravi Ramamoorthi¹

¹University of California, San Diego, ²Facebook Reality Labs

Requirements

Install required packages

Make sure you have up-to-date NVIDIA drivers supporting CUDA 11.1 (10.2 could work but need to change cudatoolkit package accordingly)

Run

conda env create -f environment.yml
conda activate video_viewsynth

Usage

Rendering

Download our pretrained checkpoint and testing data. Extract the content to [path_to_data_directory]. It contains frames and background folders, as well as poses_bounds.npy.
In configs, setup data path by changing render_video.txt

root_dir should point to the frames folder mentioned in 1. and bg_dir should point to background folder.

out_dir can be your desired output folder.

ckpt_path should be the pretrained checkpoint path.
Run python render_llff_video.py --config [config_file_path]

e.g. python render_llff_video.py --config ../configs/render_video.txt

(Optional) For your own data, please run prepare_data.sh

sh render.sh [frame_folder] [starting_frame] [ending_frame] [output_folder_name]

Make sure your data is in this structure before running
```
[frame_folder] --- cam00 --- 00000.jpg
                |         |- 00001.jpg
                |         ...
                |- cam01
                |- cam02
                ...
                |- poses_bounds.npy
```
e.g. sh render.sh ~/deep_3d_data/frames 0 20 qual

Training

Train MPI

Download RealEstate10K dataset and extract the frames. There are scripts in preprocessing folder which can be used to generate the data.

The order should be download_data.py -> extract_frames.py -> compress_data.py.

Remember to change the path in compress_data.py.
Change the paths in config file train_realestate10k.txt

Run

cd train_mpi
python train.py --config ../configs/train_realestate10k.txt

Train Mask

Once MPI is trained, we can use the checkpoint to train 3D mask network.

Download dataset
Change the paths in config file train_mask.txt

Run

cd train_mask
python train.py --config ../configs/train_mask.txt

Citation

@inproceedings {lin2021deep,
    title = {Deep 3D Mask Volume for View Synthesis of Dynamic Scenes},
    author = {Kai-En Lin and Lei Xiao and Feng Liu and Guowei Yang and Ravi Ramamoorthi},
    booktitle = {ICCV},
    year = {2021},
}

Official PyTorch Implementation of paper "Deep 3D Mask Volume for View Synthesis of Dynamic Scenes", ICCV 2021.

Related tags

Overview

Deep 3D Mask Volume for View Synthesis of Dynamic Scenes

Requirements

Install required packages

Usage

Rendering

Training

Train MPI

Train Mask

Citation

Owner

Ken Lin

PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning

Tools for investing in Python

PaRT: Parallel Learning for Robust and Transparent AI

A universal memory dumper using Frida

Real-time LIDAR-based Urban Road and Sidewalk detection for Autonomous Vehicles 🚗

The FIRST GANs-based omics-to-omics translation framework

YOLO-v5 기반 단안 카메라의 영상을 활용해 차간 거리를 일정하게 유지하며 주행하는 Adaptive Cruise Control 기능 구현

PyTorch implementation of "MLP-Mixer: An all-MLP Architecture for Vision" Tolstikhin et al. (2021)

A Distributional Approach To Controlled Text Generation

PyTorch implementation of the paper: Label Noise Transition Matrix Estimation for Tasks with Lower-Quality Features

Training neural models with structured signals.

Reinforcement learning framework and algorithms implemented in PyTorch.

Official PyTorch implementation of "Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image", ICCV 2019

Deep Distributed Control of Port-Hamiltonian Systems

This repository contains the DendroMap implementation for scalable and interactive exploration of image datasets in machine learning.

PyTorch/GPU re-implementation of the paper Masked Autoencoders Are Scalable Vision Learners

Pytorch implementation of MalConv

Simple Dynamic Batching Inference

CoReD: Generalizing Fake Media Detection with Continual Representation using Distillation (ACMMM'21 Oral Paper)

NFT-Price-Prediction-CNN - Using visual feature extraction, prices of NFTs are predicted via CNN (Alexnet and Resnet) architectures.