Repository for "Toward Practical Monocular Indoor Depth Estimation" (CVPR 2022)

Last update: Dec 13, 2022

Related tags

Deep Learning DistDepth

Overview

Toward Practical Monocular Indoor Depth Estimation

Cho-Ying Wu, Jialiang Wang, Michael Hall, Ulrich Neumann, Shuochen Su

[arXiv] [project site]

DistDepth

Our DistDepth is a highly robust monocular depth estimation approach for generic indoor scenes.

Trained with stereo sequences without their groundtruth depth
Structured and metric-accurate
Run in an interactive rate with Laptop GPU
Sim-to-real: trained on simulation and becomes transferrable to real scenes

Single Image Inference Demo

We test on Ubuntu 20.04 LTS with an laptop NVIDIA 2080 GPU (only GPU mode is supported).

Install packages

Use conda

conda create --name distdepth python=3.8 conda activate distdepth
Install pre-requisite common packages. Go to https://pytorch.org/get-started/locally/ and install pytorch that is compatible to your computer. We test on pytorch v1.9.0 and cudatoolkit-11.1. (The codes should work under other v1.0+ versions)

conda install pytorch==1.9.0 torchvision==0.10.0 torchaudio==0.9.0 cudatoolkit=11.3 -c pytorch -c conda-forge
Install other dependencies: opencv-python and matplotlib.

pip install opencv-python, matplotlib

Download pretrained models

Download pretrained models [here] (ResNet152, 246MB).
Move the downloaded item under this folder, and then unzip it. You should be able to see a new folder 'ckpts' that contains the pretrained models.
Run

python demo.py
Results will be stored under results/

Data

Download SimSIN [here]. For UniSIN and VA, please download at the [project site].

Depth-aware AR effects

Virtual object insertion:

Dragging objects along a trajectory:

Citation

@inproceedings{wu2022toward,
title={Toward Practical Monocular Indoor Depth Estimation},
author={Wu, Cho-Ying and Wang, Jialiang and Hall, Michael and Neumann, Ulrich and Su, Shuochen},
booktitle={CVPR},
year={2022}
}

License

DistDepth is CC-BY-NC licensed, as found in the LICENSE file.

Repository for "Toward Practical Monocular Indoor Depth Estimation" (CVPR 2022)

Related tags

Overview

Toward Practical Monocular Indoor Depth Estimation

DistDepth

Single Image Inference Demo

Data

Depth-aware AR effects

Citation

License

Owner

Meta Research

Libraries, tools and tasks created and used at DeepMind Robotics.

Advanced Deep Learning with TensorFlow 2 and Keras (Updated for 2nd Edition)

Tree Nested PyTorch Tensor Lib

Differentiable Abundance Matching With Python

Automatic learning-rate scheduler

A strongly-typed genetic programming framework for Python

Heterogeneous Temporal Graph Neural Network

Model Quantization Benchmark

Raptor-Multi-Tool - Raptor Multi Tool With Python

IhoneyBakFileScan Modify - 批量网站备份文件扫描器，增加文件规则，优化内存占用

Privacy-Preserving Portrait Matting [ACM MM-21]

Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it

Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos

Read and write layered TIFF ImageSourceData and ImageResources tags

Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation (ICCV 2021)

HINet: Half Instance Normalization Network for Image Restoration

Code for reproducing key results in the paper "InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets"

Some tentative models that incorporate label propagation to graph neural networks for graph representation learning in nodes, links or graphs.

Implementation of "JOKR: Joint Keypoint Representation for Unsupervised Cross-Domain Motion Retargeting"

StarGAN-ZSVC: Unofficial PyTorch Implementation