Repository for "Toward Practical Monocular Indoor Depth Estimation" (CVPR 2022)

Last update: Dec 13, 2022

Related tags

Deep Learning DistDepth

Overview

Toward Practical Monocular Indoor Depth Estimation

Cho-Ying Wu, Jialiang Wang, Michael Hall, Ulrich Neumann, Shuochen Su

[arXiv] [project site]

DistDepth

Our DistDepth is a highly robust monocular depth estimation approach for generic indoor scenes.

Trained with stereo sequences without their groundtruth depth
Structured and metric-accurate
Run in an interactive rate with Laptop GPU
Sim-to-real: trained on simulation and becomes transferrable to real scenes

Single Image Inference Demo

We test on Ubuntu 20.04 LTS with an laptop NVIDIA 2080 GPU (only GPU mode is supported).

Install packages

Use conda

conda create --name distdepth python=3.8 conda activate distdepth
Install pre-requisite common packages. Go to https://pytorch.org/get-started/locally/ and install pytorch that is compatible to your computer. We test on pytorch v1.9.0 and cudatoolkit-11.1. (The codes should work under other v1.0+ versions)

conda install pytorch==1.9.0 torchvision==0.10.0 torchaudio==0.9.0 cudatoolkit=11.3 -c pytorch -c conda-forge
Install other dependencies: opencv-python and matplotlib.

pip install opencv-python, matplotlib

Download pretrained models

Download pretrained models [here] (ResNet152, 246MB).
Move the downloaded item under this folder, and then unzip it. You should be able to see a new folder 'ckpts' that contains the pretrained models.
Run

python demo.py
Results will be stored under results/

Data

Download SimSIN [here]. For UniSIN and VA, please download at the [project site].

Depth-aware AR effects

Virtual object insertion:

Dragging objects along a trajectory:

Citation

@inproceedings{wu2022toward,
title={Toward Practical Monocular Indoor Depth Estimation},
author={Wu, Cho-Ying and Wang, Jialiang and Hall, Michael and Neumann, Ulrich and Su, Shuochen},
booktitle={CVPR},
year={2022}
}

License

DistDepth is CC-BY-NC licensed, as found in the LICENSE file.

Repository for "Toward Practical Monocular Indoor Depth Estimation" (CVPR 2022)

Related tags

Overview

Toward Practical Monocular Indoor Depth Estimation

DistDepth

Single Image Inference Demo

Data

Depth-aware AR effects

Citation

License

Owner

Meta Research

Simple, but essential Bayesian optimization package

PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces

[ICCV'21] Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment

CRF-RNN for Semantic Image Segmentation - PyTorch version

When are Iterative GPs Numerically Accurate?

A Python library for adversarial machine learning focusing on benchmarking adversarial robustness.

TDN: Temporal Difference Networks for Efficient Action Recognition

Neural models of common sense. 🤖

A curated list of the top 10 computer vision papers in 2021 with video demos, articles, code and paper reference.

Code for the Paper: Alexandra Lindt and Emiel Hoogeboom.

This is an official implementation of the High-Resolution Transformer for Dense Prediction.

UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model

[arXiv] What-If Motion Prediction for Autonomous Driving ❓🚗💨

Segmentation for medical image.

Collect super-resolution related papers, data, repositories

This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"

Low Complexity Channel estimation with Neural Network Solutions

A library for Deep Learning Implementations and utils

Toward Spatially Unbiased Generative Models (ICCV 2021)

Unofficial PyTorch Implementation of "Augmenting Convolutional networks with attention-based aggregation"