Python scripts for performing stereo depth estimation using the MobileStereoNet model in ONNX

Last update: Nov 29, 2022

Related tags

Overview

ONNX-MobileStereoNet

Python scripts for performing stereo depth estimation using the MobileStereoNet model in ONNX

Stereo depth estimation on the cones images from the Middlebury dataset (https://vision.middlebury.edu/stereo/data/scenes2003/)

Requirements

Check the requirements.txt file. Additionally, pafy and youtube-dl are required for youtube video inference.

Installation

pip install -r requirements.txt
pip install pafy youtube-dl

ONNX model

The original models were converted to different formats (including .onnx) by PINTO0309, the models can be found in his repository.

Original Pytorch model

The Pytorch pretrained model was taken from the original repository.

Examples

Image inference:

python image_depth_estimation.py

Video inference:

python video_depth_estimation.py

DrivingStereo dataset inference:

python driving_sereo_test.py

Inference video Example

References:

MobileStereoNet model: https://github.com/cogsys-tuebingen/mobilestereonet
PINTO0309's model zoo: https://github.com/PINTO0309/PINTO_model_zoo
PINTO0309's model conversion tool: https://github.com/PINTO0309/openvino2tensorflow
DrivingStereo dataset: https://drivingstereo-dataset.github.io/
Original paper: https://arxiv.org/pdf/2108.09770.pdf

Python scripts for performing stereo depth estimation using the MobileStereoNet model in ONNX

Related tags

Overview

ONNX-MobileStereoNet

Requirements

Installation

ONNX model

Original Pytorch model

Examples

Inference video Example

References:

Owner

Ibai Gorordo

Official Code Release for Container : Context Aggregation Network

Repository for "Improving evidential deep learning via multi-task learning," published in AAAI2022

Thermal Control of Laser Powder Bed Fusion using Deep Reinforcement Learning

Gif-caption - A straightforward GIF Captioner written in Python

LSTMs (Long Short Term Memory) RNN for prediction of price trends

PyTorch implementation for Convolutional Networks with Adaptive Inference Graphs

Implementation of the paper ''Implicit Feature Refinement for Instance Segmentation''.

QuadTree Attention for Vision Transformers (ICLR2022)

BLEURT is a metric for Natural Language Generation based on transfer learning.

A framework for Quantification written in Python

An Intelligent Self-driving Truck System For Highway Transportation

TC-GNN with Pytorch integration

A vision library for performing sliced inference on large images/small objects

unet-family: Ultimate version

CoReNet is a technique for joint multi-object 3D reconstruction from a single RGB image.

Supplementary code for SIGGRAPH 2021 paper: Discovering Diverse Athletic Jumping Strategies

Improving Factual Consistency of Abstractive Text Summarization

Hypernetwork-Ensemble Learning of Segmentation Probability for Medical Image Segmentation with Ambiguous Labels

ComPhy: Compositional Physical Reasoning ofObjects and Events from Videos

[ICCV'21] Neural Radiance Flow for 4D View Synthesis and Video Processing