Python scripts form performing stereo depth estimation using the HITNET model in Tensorflow Lite.

Last update: Oct 20, 2022

Overview

TFLite-HITNET-Stereo-depth-estimation

Python scripts form performing stereo depth estimation using the HITNET model in Tensorflow Lite.

Stereo depth estimation on the cones images from the Middlebury dataset (https://vision.middlebury.edu/stereo/data/scenes2003/)

Requirements

OpenCV, imread-from-url and tensorflow==2.6.0 or tflite_runtime. Also, pafy and youtube-dl are required for youtube video inference.

Installation

pip install -r requirements.txt
pip install pafy youtube-dl

For the tflite runtime, you can either use tensorflow(make sure it is version 2.6.0 or above) pip install tensorflow==2.6.0 or the TensorFlow Runtime binary

Known issues

In computers with a GPU, the program would silently creash without any error during the inference, os.environ["CUDA_VISIBLE_DEVICES"]="-1" is added at the beginning of the script to force the program to run on the CPU. You can comment this line for other types of devices.

tflite model

The original models were converted to different formats (including .tflite) by PINTO0309, download the models from his repository and save them into the models folder.

Original Tensorflow model

The Tensorflow pretrained model was taken from the original repository.

Examples

Image inference:

python imageDepthEstimation.py

Video inference:

python videoDepthEstimation.py

DrivingStereo dataset inference:

python drivingStereoTest.py

Pytorch inference

For performing the inference in Tensorflow, check my other repository HITNET Stereo Depth estimation.

ONNX inference

For performing the inference in ONNX, check my other repository ONNX HITNET Stereo Depth estimation.

Inference video Example Raspberry Pi 4

References:

Hitnet model: https://github.com/google-research/google-research/tree/master/hitnet
PINTO0309's model zoo: https://github.com/PINTO0309/PINTO_model_zoo
PINTO0309's model conversion tool: https://github.com/PINTO0309/openvino2tensorflow
DrivingStereo dataset: https://drivingstereo-dataset.github.io/
Original paper: https://arxiv.org/abs/2007.12140

Python scripts form performing stereo depth estimation using the HITNET model in Tensorflow Lite.

Related tags

Overview

TFLite-HITNET-Stereo-depth-estimation

Requirements

Installation

Known issues

tflite model

Original Tensorflow model

Examples

Pytorch inference

ONNX inference

Inference video Example Raspberry Pi 4

References:

Owner

Ibai Gorordo

This is the official repository for our paper: ''Pruning Self-attentions into Convolutional Layers in Single Path''.

Open CV - Convert a picture to look like a cartoon sketch in python

PaRT: Parallel Learning for Robust and Transparent AI

My solutions for Stanford University course CS224W: Machine Learning with Graphs Fall 2021 colabs (GNN, GAT, GraphSAGE, GCN)

A simple program for training and testing vit

A unified framework for machine learning with time series

Neon-erc20-example - Example of creating SPL token and wrapping it with ERC20 interface in Neon EVM

SimpleDepthEstimation - An unified codebase for NN-based monocular depth estimation methods

Audio-Visual Generalized Few-Shot Learning with Prototype-Based Co-Adaptation

Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for optimizing the latent space of VQGAN for each input prompt

An end-to-end machine learning library to directly optimize AUC loss

Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval.

Code accompanying the paper on "An Empirical Investigation of Domain Generalization with Empirical Risk Minimizers" published at NeurIPS, 2021

a project for 3D multi-object tracking

MDETR: Modulated Detection for End-to-End Multi-Modal Understanding

Attentive Implicit Representation Networks (AIR-Nets)

This repository contains several jupyter notebooks to help users learn to use neon, our deep learning framework

Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

Pytorch implementation of YOLOX、PPYOLO、PPYOLOv2、FCOS an so on.

This is the official code of L2G, Unrolling and Recurrent Unrolling in Learning to Learn Graph Topologies.