Code repo for "Towards Interpretable Deep Networks for Monocular Depth Estimation" paper.

Last update: Aug 12, 2022

Related tags

Deep Learning InterpretableMDE

Overview

InterpretableMDE

A PyTorch implementation for "Towards Interpretable Deep Networks for Monocular Depth Estimation" paper.

arXiv link: https://arxiv.org/abs/2108.05312

Data and Model

For MFF models, we use the dataset they released here, and you can download their models as the baselines here. For BTS models, they use a different set of NYUv2 training images (24,231 instead of 50,688), and you download it here. We put all of our models here.

Evaluation

In this project we use yacs to manage the configurations. To evaluate the performance of a model, for example, the MFF model with SENet backbone using our assigning method, simply run

python eval.py MODEL_WEIGHTS_FILE [PATH_TO_MODEL/mff_senet_asn]

from the root directory.

To evaluate the depth selectivity, run

python dissect.py MODEL_WEIGHTS_FILE [PATH_TO_MODEL/mff_senet_asn] LAYERS D_MFF ON_TRAINING_DATA True

then get the depth selectivity and the dissection result of each unit. Layers' names are seperated by _.

Training

To train a model from scratch, run

python train.py MODEL_NAME MFF_resnet

We currently provide four options for MODEL_NAME, and the training scheme will automatically be switched to align with the original ones when using BTS models.

Acknowledgement

The model part of our code is adapted from Revisiting_Single_Depth_Estimation and bts. Some snippets are adapted from monodepth2.

Bibtex

@inproceedings{you2021iccv,
 title = {Towards Interpretable Deep Networks for Monocular Depth Estimation},
 author = {Zunzhi You and Yi-Hsuan Tsai and Wei-Chen Chiu and Guanbin Li},
 booktitle = {International Conference on Computer Vision (ICCV)},
 year = {2021}
}

Code repo for "Towards Interpretable Deep Networks for Monocular Depth Estimation" paper.

Related tags

Overview

InterpretableMDE

Data and Model

Evaluation

Training

Acknowledgement

Bibtex

Owner

Zunzhi You

A facial recognition doorbell system using a Raspberry Pi

A Fast Knowledge Distillation Framework for Visual Recognition

Vehicle Detection Using Deep Learning and YOLO Algorithm

FS2KToolbox FS2K Dataset Towards the translation between Face

code and data for paper "GIANT: Scalable Creation of a Web-scale Ontology"

A lightweight library to compare different PyTorch implementations of the same network architecture.

An efficient and effective learning to rank algorithm by mining information across ranking candidates. This repository contains the tensorflow implementation of SERank model. The code is developed based on TF-Ranking.

This repo includes our code for evaluating and improving transferability in domain generalization (NeurIPS 2021)

The Power of Scale for Parameter-Efficient Prompt Tuning

Language models are open knowledge graphs ( non official implementation )

A GUI for Face Recognition, based upon Docker, Tkinter, GPU and a camera device.

A customisable game where you have to quickly click on black tiles in order of appearance while avoiding clicking on white squares.

some classic model used to segment the medical images like CT、X-ray and so on

Tracking Pipeline helps you to solve the tracking problem more easily

[Machine Learning Engineer Basic Guide] 부스트캠프 AI Tech - Product Serving 자료

A python module for scientific analysis of 3D objects based on VTK and Numpy

TensorFlow (v2.7.0) benchmark results on an M1 Macbook Air 2020 laptop (macOS Monterey v12.1).

[CVPR 2021] Teachers Do More Than Teach: Compressing Image-to-Image Models (CAT)

Official implementation for paper: Feature-Style Encoder for Style-Based GAN Inversion

Repo for EMNLP 2021 paper "Beyond Preserved Accuracy: Evaluating Loyalty and Robustness of BERT Compression"