Code repo for "Towards Interpretable Deep Networks for Monocular Depth Estimation" paper.

Last update: Aug 12, 2022

Related tags

Deep Learning InterpretableMDE

Overview

InterpretableMDE

A PyTorch implementation for "Towards Interpretable Deep Networks for Monocular Depth Estimation" paper.

arXiv link: https://arxiv.org/abs/2108.05312

Data and Model

For MFF models, we use the dataset they released here, and you can download their models as the baselines here. For BTS models, they use a different set of NYUv2 training images (24,231 instead of 50,688), and you download it here. We put all of our models here.

Evaluation

In this project we use yacs to manage the configurations. To evaluate the performance of a model, for example, the MFF model with SENet backbone using our assigning method, simply run

python eval.py MODEL_WEIGHTS_FILE [PATH_TO_MODEL/mff_senet_asn]

from the root directory.

To evaluate the depth selectivity, run

python dissect.py MODEL_WEIGHTS_FILE [PATH_TO_MODEL/mff_senet_asn] LAYERS D_MFF ON_TRAINING_DATA True

then get the depth selectivity and the dissection result of each unit. Layers' names are seperated by _.

Training

To train a model from scratch, run

python train.py MODEL_NAME MFF_resnet

We currently provide four options for MODEL_NAME, and the training scheme will automatically be switched to align with the original ones when using BTS models.

Acknowledgement

The model part of our code is adapted from Revisiting_Single_Depth_Estimation and bts. Some snippets are adapted from monodepth2.

Bibtex

@inproceedings{you2021iccv,
 title = {Towards Interpretable Deep Networks for Monocular Depth Estimation},
 author = {Zunzhi You and Yi-Hsuan Tsai and Wei-Chen Chiu and Guanbin Li},
 booktitle = {International Conference on Computer Vision (ICCV)},
 year = {2021}
}

Code repo for "Towards Interpretable Deep Networks for Monocular Depth Estimation" paper.

Related tags

Overview

InterpretableMDE

Data and Model

Evaluation

Training

Acknowledgement

Bibtex

Owner

Zunzhi You

Optimizers-visualized - Visualization of different optimizers on local minimas and saddle points.

Python package for missing-data imputation with deep learning

A curated list of references for MLOps

Open Source Light Field Toolbox for Super-Resolution

Short and long time series classification using convolutional neural networks

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with ONNX, TensorRT, ncnn, and OpenVINO supported.

PyTorchVideo is a deeplearning library with a focus on video understanding work

The open source code of SA-UNet: Spatial Attention U-Net for Retinal Vessel Segmentation.

MINIROCKET: A Very Fast (Almost) Deterministic Transform for Time Series Classification

Pre-trained Deep Learning models and demos (high quality and extremely fast)

Self-Supervised Learning with Data Augmentations Provably Isolates Content from Style

AI drive app that can help user become beautiful.

Unofficial pytorch implementation of 'Image Inpainting for Irregular Holes Using Partial Convolutions'

PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World [ACL 2021]

SporeAgent: Reinforced Scene-level Plausibility for Object Pose Refinement

This is the official Pytorch implementation of the paper "Diverse Motion Stylization for Multiple Style Domains via Spatial-Temporal Graph-Based Generative Model"

A library for low-memory inferencing in PyTorch.

Generate images from texts. In Russian. In PaddlePaddle

Official source code of paper 'IterMVS: Iterative Probability Estimation for Efficient Multi-View Stereo'

🤖 Project template for your next awesome AI project. 🦾