[ICCV 2021 (oral)] Planar Surface Reconstruction from Sparse Views

Last update: Jan 05, 2023

Related tags

Overview

Planar Surface Reconstruction From Sparse Views

Linyi Jin, Shengyi Qian, Andrew Owens, David F. Fouhey
University of Michigan
ICCV 2021 (Oral)

This repo contains code for our paper. Our model is implemented in Detectron2.

Given two RGB images with an unknown relationship, our system produces a single, coherent planar surface reconstruction of the scene in terms of 3D planes and relative camera poses.

We use a ResNet50-FPN to detect planes and predict probabilities of relative camera poses, and use a two-step optimization to generate a coherent planar reconstruction. (a) For each plane, we predict a segmentation mask, plane parameters, and an appearance feature. (b) Concurrently, we pass image features from the detection backbone through the attention layer and predict the camera transformation between views. (c) Our discrete optimization fuses the prediction of the separate heads to select the best camera pose and plane correspondence. (d) Finally, we use continuous optimization to update the camera and plane parameters.

Usage Instructions

Citation

If you find this code useful, please consider citing:

@inproceedings{jin2021planar,
      title={Planar Surface Reconstruction from Sparse Views}, 
      author={Linyi Jin and Shengyi Qian and Andrew Owens and David F. Fouhey},
      booktitle = {ICCV},
      year={2021}
}

Acknowledgment

We thank Dandan Shan, Mohamed El Banani, Nilesh Kulkarni, Richard Higgins for helpful discussions. Toyota Research Institute ("TRI") provided funds to assist the authors with their research but this article solely reflects the opinions and conclusions of its authors and not TRI or any other Toyota entity.

[ICCV 2021 (oral)] Planar Surface Reconstruction from Sparse Views

Related tags

Overview

Planar Surface Reconstruction From Sparse Views

Linyi Jin, Shengyi Qian, Andrew Owens, David F. Fouhey
University of Michigan
ICCV 2021 (Oral)

Usage Instructions

Citation

Acknowledgment

Owner

Linyi Jin

Provably Rare Gem Miner.

Code for "NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video", CVPR 2021 oral

Code and data accompanying our SVRHM'21 paper.

NAS-FCOS: Fast Neural Architecture Search for Object Detection (CVPR 2020)

Distributed Evolutionary Algorithms in Python

Pytorch based library to rank predicted bounding boxes using text/image user's prompts.

Implementation of "Large Steps in Inverse Rendering of Geometry"

A flexible ML framework built to simplify medical image reconstruction and analysis experimentation.

YOLOv5🚀 reproduction by Guo Quanhao using PaddlePaddle

Augmented Traffic Control: A tool to simulate network conditions

Cross-Image Region Mining with Region Prototypical Network for Weakly Supervised Segmentation

EXplainable Artificial Intelligence (XAI)

CryptoFrog - My First Strategy for freqtrade

Datasets and pretrained Models for StyleGAN3 ...

A tutorial on training a DarkNet YOLOv4 model for the CrowdHuman dataset

Autonomous Movement from Simultaneous Localization and Mapping

PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, wav2lip, picture repair, image editing, photo2cartoon, image style transfer, and so on.

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

This is Official implementation for "Pose-guided Feature Disentangling for Occluded Person Re-Identification Based on Transformer" in AAAI2022

An Intelligent Self-driving Truck System For Highway Transportation

[ICCV 2021 (oral)] Planar Surface Reconstruction from Sparse Views

Related tags

Overview

Planar Surface Reconstruction From Sparse Views

Linyi Jin, Shengyi Qian, Andrew Owens, David F. Fouhey University of Michigan ICCV 2021 (Oral)

Usage Instructions

Citation

Acknowledgment

Owner

Linyi Jin

Provably Rare Gem Miner.

Code for "NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video", CVPR 2021 oral

Code and data accompanying our SVRHM'21 paper.

NAS-FCOS: Fast Neural Architecture Search for Object Detection (CVPR 2020)

Distributed Evolutionary Algorithms in Python

Pytorch based library to rank predicted bounding boxes using text/image user's prompts.

Implementation of "Large Steps in Inverse Rendering of Geometry"

A flexible ML framework built to simplify medical image reconstruction and analysis experimentation.

YOLOv5🚀 reproduction by Guo Quanhao using PaddlePaddle

Augmented Traffic Control: A tool to simulate network conditions

Cross-Image Region Mining with Region Prototypical Network for Weakly Supervised Segmentation

EXplainable Artificial Intelligence (XAI)

CryptoFrog - My First Strategy for freqtrade

Datasets and pretrained Models for StyleGAN3 ...

A tutorial on training a DarkNet YOLOv4 model for the CrowdHuman dataset

Autonomous Movement from Simultaneous Localization and Mapping

PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, wav2lip, picture repair, image editing, photo2cartoon, image style transfer, and so on.

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

This is Official implementation for "Pose-guided Feature Disentangling for Occluded Person Re-Identification Based on Transformer" in AAAI2022

An Intelligent Self-driving Truck System For Highway Transportation

Linyi Jin, Shengyi Qian, Andrew Owens, David F. Fouhey
University of Michigan
ICCV 2021 (Oral)