the official code for ICRA 2021 Paper: "Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation"

Last update: Jul 27, 2022

Related tags

Deep Learning G2S

Overview

G2S

This is the official code for ICRA 2021 Paper: Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation by Hemang Chawla, Arnav Varma, Elahe Arani and Bahram Zonooz.

G2S (GPS-to-Scale) Loss is a dynamically-weighted loss that can be added to the appearance-based losses to train any monocular self-supervised depth estimation architecture to get scale-consistant and scale-aware depth estimates at inference.

Here, we provide helper GPS dataloader and the G2S loss classes for using this loss with any model.

For details, please see the Paper and Presentation.

KITTI GPS

The GPS files containing geodesic gps information of raw kitti dataset in local coordinates for training with the g2s loss can be found in the assets folder as kitti_gps_raw.zip.
Unzip the file at /path/to/KITTI/raw_data/sync to merge the GPS files in the expected directory tree structure.

Usage

You can use the G2S class in lossG2S.py within your project for scale-consistent and -aware predictions. This requires using the copresent GPS modality along with images. To load the GPS, please adopt the GPSDataloader class within dataloaderGPS.py into your images dataloader.

Cite Our Work

If you find the code useful in your research, please consider citing our paper:

@inproceedings{chawlavarma2021multimodal,
	author={H. {Chawla} and A. {Varma} and E. {Arani} and B. {Zonooz}},
	booktitle={2021 IEEE International Conference on Robotics and Automation (ICRA)},
	title={Multimodal Scale Consistency and Awareness for Monocular Self-Supervised
	Depth Estimation},
	location={Xi’an, China},
	publisher={IEEE (in press)},
	year={2021}
}

License

This project is licensed under the terms of the MIT license.

the official code for ICRA 2021 Paper: "Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation"

Related tags

Overview

G2S

KITTI GPS

Usage

Cite Our Work

License

Owner

NeurAI

Semantic Segmentation in Pytorch. Network include: FCN、FCN_ResNet、SegNet、UNet、BiSeNet、BiSeNetV2、PSPNet、DeepLabv3_plus、 HRNet、DDRNet

Simple STAC Catalogs discovery tool.

ICLR2021 (Under Review)

ICNet for Real-Time Semantic Segmentation on High-Resolution Images, ECCV2018

Easy and Efficient Object Detector

Code for "Single-view robot pose and joint angle estimation via render & compare", CVPR 2021 (Oral).

Official implementation of Deep Reparametrization of Multi-Frame Super-Resolution and Denoising

Code of paper "Compositionally Generalizable 3D Structure Prediction"

Official source code of paper 'IterMVS: Iterative Probability Estimation for Efficient Multi-View Stereo'

Trajectory Prediction with Graph-based Dual-scale Context Fusion

Predict halo masses from simulations via graph neural networks

Official Implementation of "Learning Disentangled Behavior Embeddings"

FaceAnon - Anonymize people in images and videos using yolov5-crowdhuman

Performance Analysis of Multi-user NOMA Wireless-Powered mMTC Networks: A Stochastic Geometry Approach

Pytorch implementation of the paper SPICE: Semantic Pseudo-labeling for Image Clustering

Improving 3D Object Detection with Channel-wise Transformer

Language-Agnostic Website Embedding and Classification

Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pretrained models.

Pytorch-Swin-Unet-V2 - a modified version of Swin Unet based on Swin Transfomer V2

Code for ICCV2021 paper SPEC: Seeing People in the Wild with an Estimated Camera