Monocular Depth Estimation - Weighted-average prediction from multiple pre-trained depth estimation models

Last update: Nov 21, 2022

Overview

merged_depth runs (1) AdaBins, (2) DiverseDepth, (3) MiDaS, (4) SGDepth, and (5) Monodepth2, and calculates a weighted-average per-pixel absolute depth estimation.

Quick Start

First, download the pretrained models using the download_models script.

Next, run the infer script - this will run on all images in test/input and save the results to test/output.

python3 -m pip install -r requirements.txt
python3 -m merged_depth.utils.download_models
python3 -m merged_depth.infer

The results include (1) a _depth.npy file that you can load (see load_and_display_depth.py), (2) a _stacked.png file that shows the original and colorized depth images.

To run the predictor on a single input, use infer_single.py

python3 -m merged_depth.infer_single ~/foo/bar/test.png

Sample Output

Owner

Pranav

GitHub Repository

Unofficial Implementation of MLP-Mixer, Image Classification Model

MLP-Mixer Unoffical Implementation of MLP-Mixer, easy to use with terminal. Train and test easly. https://arxiv.org/abs/2105.01601 MLP-Mixer is an arc

6 Dec 05, 2022

Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes

Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized C

139 Jan 04, 2023

Kinetics-Data-Preprocessing

Kinetics-Data-Preprocessing Kinetics-400 and Kinetics-600 are common video recognition datasets used by popular video understanding projects like Slow

7 Oct 27, 2022

This is the official Pytorch implementation of the paper "Diverse Motion Stylization for Multiple Style Domains via Spatial-Temporal Graph-Based Generative Model"

Diverse Motion Stylization (Official) This is the official Pytorch implementation of this paper. Diverse Motion Stylization for Multiple Style Domains

28 Dec 16, 2022

Monocular Depth Estimation - Weighted-average prediction from multiple pre-trained depth estimation models

Related tags

Overview

Quick Start

Sample Output

Owner

Pranav

Unofficial Implementation of MLP-Mixer, Image Classification Model

Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes

Kinetics-Data-Preprocessing

This is the official Pytorch implementation of the paper "Diverse Motion Stylization for Multiple Style Domains via Spatial-Temporal Graph-Based Generative Model"

PyTorch implementation of PSPNet

Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System

Official repo for SemanticGAN https://nv-tlabs.github.io/semanticGAN/

Official implementation of our CVPR2021 paper "OTA: Optimal Transport Assignment for Object Detection" in Pytorch.

Exe-to-xlsm - Simple script to create VBscript of exe and inject to xlsm

3D-CariGAN: An End-to-End Solution to 3D Caricature Generation from Normal Face Photos

i-SpaSP: Structured Neural Pruning via Sparse Signal Recovery

Official code repository for the EMNLP 2021 paper

C3D is a modified version of BVLC caffe to support 3D ConvNets.

ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives

Single Image Super-Resolution (SISR) with SRResNet, EDSR and SRGAN

A real world application of a Recurrent Neural Network on a binary classification of time series data

The code for Bi-Mix: Bidirectional Mixing for Domain Adaptive Nighttime Semantic Segmentation

DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)

Official implementation of the PICASO: Permutation-Invariant Cascaded Attentional Set Operator

Deep Markov Factor Analysis (NeurIPS2021)