Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth [Paper]

Last update: Jan 01, 2023

Related tags

Deep Learning GLPDepth

Overview

Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth [Paper]

Downloads

[Downloads] Trained ckpt files for NYU Depth V2 and KITTI
[Downloads] Predicted depth maps png files for NYU Depth V2 and KITTI Eigen split test set

Requirements

Tested on

python==3.7.7
torch==1.6.0
h5py==3.6.0
scipy==1.7.3
opencv-python==4.5.5
mmcv==1.4.3
timm=0.5.4
albumentations=1.1.0
tensorboardX==2.4.1

You can install above package with

$ pip install -r requirements.txt

Inference and Evaluate

Dataset

NYU Depth V2

$ cd ./datasets
$ wget http://horatio.cs.nyu.edu/mit/silberman/nyu_depth_v2/nyu_depth_v2_labeled.mat
$ python ../code/utils/extract_official_train_test_set_from_mat.py nyu_depth_v2_labeled.mat splits.mat ./nyu_depth_v2/official_splits/

KITTI

Download annotated depth maps data set (14GB) from [link] into ./datasets/kitti/data_depth_annotated

$ cd ./datasets/kitti/data_depth_annotated/
$ unzip data_depth_annotated.zip

With above two instrtuctions, you can perform eval_with_pngs.py/test.py for NYU Depth V2 and eval_with_pngs for KITTI.

To fully perform experiments, please follow [BTS] repository to obtain full dataset for NYU Depth V2 and KITTI datasets.

Your dataset directory should be

root
- nyu_depth_v2
  - bathroom_0001
  - bathroom_0002
  - ...
  - official_splits
- kitti
  - data_depth_annotated
  - raw_data
  - val_selection_cropped

Evaluation

Evaluate with png images

for NYU Depth V2

$ python ./code/eval_with_pngs.py --dataset nyudepthv2 --pred_path ./best_nyu_preds/ --gt_path ./datasets/nyu_depth_v2/ --max_depth_eval 10.0

for KITTI

$ python ./code/eval_with_pngs.py --dataset kitti --split eigen_benchmark --pred_path ./best_kitti_preds/ --gt_path ./datasets/kitti/ --max_depth_eval 80.0 --garg_crop

Evaluate with model (NYU Depth V2)

Result images will be saved in ./args.result_dir/args.exp_name (default: ./results/test)

To evaluate only

$ python ./code/test.py --dataset nyudepthv2 --data_path ./datasets/ --ckpt_dir 
       
         --do_evaluate  --max_depth 10.0 --max_depth_eval 10.0

To save pngs for eval_with_pngs

$ python ./code/test.py --dataset nyudepthv2 --data_path ./datasets/ --ckpt_dir 
       
         --save_eval_pngs  --max_depth 10.0 --max_depth_eval 10.0

To save visualized depth maps

$ python ./code/test.py --dataset nyudepthv2 --data_path ./datasets/ --ckpt_dir 
       
         --save_visualize  --max_depth 10.0 --max_depth_eval 10.0

In case of kitti, modify arguments to --dataset kitti --max_depth 80.0 --max_depth_eval 80.0 and add --kitti_crop [garg_crop or eigen_crop]

Inference

Inference with image directory

$ python ./code/test.py --dataset imagepath --data_path 
     
       --save_visualize

To-Do

Add inference
Add training codes
Add dockerHub link
Add colab

References

[1] From Big to Small: Multi-Scale Local Planar Guidance for Monocular Depth Estimation. [code]

[2] SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers. [code]

Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth [Paper]

Related tags

Overview

Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth [Paper]

Downloads

Requirements

Inference and Evaluate

Dataset

NYU Depth V2

KITTI

Evaluation

Inference

To-Do

References

Owner

Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning, CVPR 2021

Implements Gradient Centralization and allows it to use as a Python package in TensorFlow

Model Zoo for MindSpore

Multi-Target Adversarial Frameworks for Domain Adaptation in Semantic Segmentation

Walk with fastai

Code for our paper Domain Adaptive Semantic Segmentation with Self-Supervised Depth Estimation

An Straight Dilated Network with Wavelet for image Deblurring

This repository contains the files for running the Patchify GUI.

BookMyShowPC - Movie Ticket Reservation App made with Tkinter

ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees

The codes for the work "Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation"

NCVX (NonConVeX): A User-Friendly and Scalable Package for Nonconvex Optimization in Machine Learning.

PyTorch implementation for our NeurIPS 2021 Spotlight paper "Long Short-Term Transformer for Online Action Detection".

Automatic 2D-to-3D Video Conversion with CNNs

Artificial intelligence technology inferring issues and logically supporting facts from raw text

DISTIL: Deep dIverSified inTeractIve Learning.

Implementation of BI-RADS-BERT & The Advantages of Section Tokenization.

clDice - a Novel Topology-Preserving Loss Function for Tubular Structure Segmentation

Huawei Hackathon 2021 - Sweden (Stockholm)

This repository provides data for the VAW dataset as described in the CVPR 2021 paper titled "Learning to Predict Visual Attributes in the Wild"