Code and models for "Pano3D: A Holistic Benchmark and a Solid Baseline for 360 Depth Estimation", OmniCV Workshop @ CVPR21.

Last update: Dec 29, 2022

Overview

Pano3D

A Holistic Benchmark and a Solid Baseline for 360^o Depth Estimation

Pano3D is a new benchmark for depth estimation from spherical panoramas. We generate a dataset (using GibsonV2) and provide baselines for holistic performance assessment, offering:

Primary and secondary traits metrics:
- Direct depth performance:
  - (w)RMSE
  - (w)RMSLE
  - AbsRel
  - SqRel
  - (w)Relative accuracy (\delta) @ {1.05, 1.1, 1.25, 1.25², 1.25³ }
- Boundary discontinuity preservation:
  - Precision @ {0.25, 0.5, 1.0}m
  - Recall @ {0.25, 0.5, 1.0}m
  - Depth boundary errors of accuracy and completeness
- Surface smoothness:
  - RMSE^o
  - Relative accuracy (\alpha) @ {11.25^o, 22.5^o, 30^o}
Out-of-distribution & Zero-shot cross dataset transfer:
- Different depth distribution test set
- Varying scene context test set
- Shifted camera domain test set

By disentangling generalization and assessing all depth properties, Pano3D aspires to drive progress benchmarking for 360^o depth estimation.

Using Pano3D to search for a solid baseline results in an acknowledgement of exploiting complementary error terms, adding encoder-decoder skip connections and using photometric augmentations.

TODO

Demo

A publicly hosted demo of the baseline models can be found here. Using the web app, it is possible to upload a panorama and download a 3D reconstructed mesh of the scene using the derived depth map.

Note that due to the external host's caching issues, it might be necessary to refresh your browser's cache in between runs to update the 3D models.

Data

Download

To download the data, follow the instructions at vcl3d.github.io/Pano3D/download/.

Please note that getting access to the data download links is a two step process as the dataset is a derivative and compliance with the original dataset's terms and usage agreements is required. Therefore:

You first need to fill in this Google Form.
And, then, you need to perform an access request at each one of the Zenodo repositories (depending on which dataset partition you need):

After both these steps are completed, you will soon receive the download links for each dataset partition.

Code and models for "Pano3D: A Holistic Benchmark and a Solid Baseline for 360 Depth Estimation", OmniCV Workshop @ CVPR21.

Related tags

Overview

Pano3D

A Holistic Benchmark and a Solid Baseline for 360o Depth Estimation

TODO

Demo

Data

Download

Loader

Splits

Models

Download

Inference

Serve

Metrics

Direct

Boundary

Smoothness

Results

Owner

Visual Computing Lab, Information Technologies Institute, Centre for Reseach and Technology Hellas

[CVPR2021 Oral] FFB6D: A Full Flow Bidirectional Fusion Network for 6D Pose Estimation.

Using VideoBERT to tackle video prediction

Repo for my Tensorflow/Keras CV experiments. Mostly revolving around the Danbooru20xx dataset

Official PyTorch implementation of Retrieve in Style: Unsupervised Facial Feature Transfer and Retrieval.

The PASS dataset: pretrained models and how to get the data - PASS: Pictures without humAns for Self-Supervised Pretraining

Joint project of the duo Hacker Ninjas

This provides the R code and data to replicate results in "The USS Trustee’s risky strategy"

Defending against Model Stealing via Verifying Embedded External Features

Unofficial implement with paper SpeakerGAN: Speaker identification with conditional generative adversarial network

Deep-learning X-Ray Micro-CT image enhancement, pore-network modelling and continuum modelling

Voice Conversion Using Speech-to-Speech Neuro-Style Transfer

Edge Restoration Quality Assessment

Escaping the Gradient Vanishing: Periodic Alternatives of Softmax in Attention Mechanism

Repository containing the PhD Thesis "Formal Verification of Deep Reinforcement Learning Agents"

Introduction to AI assignment 1 HCM University of Technology, term 211

Hydra: an Extensible Fuzzing Framework for Finding Semantic Bugs in File Systems

Final term project for Bayesian Machine Learning Lecture (XAI-623)

Code release for DS-NeRF (Depth-supervised Neural Radiance Fields)

Semantic Segmentation Suite in TensorFlow

[CIKM 2019] Code and dataset for "Fi-GNN: Modeling Feature Interactions via Graph Neural Networks for CTR Prediction"

A Holistic Benchmark and a Solid Baseline for 360^o Depth Estimation