PanopticBEV - Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images

Last update: Dec 16, 2022

Related tags

Overview

Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images

This repository contains the PyTorch implementation of the PanopticBEV model proposed in our RA-L 2021 paper Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images.

Our approach, PanopticBEV, is the state-of-the-art approach for generating panoptic segmentation maps in the bird's eye view using only monocular frontal view images.

If you find this code useful for your research, please consider citing our paper:

@article{gosala2021bev,
  title={Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images},
  author={Gosala, Nikhil and Valada, Abhinav},
  journal={arXiv preprint arXiv:2108.03227},
  year={2021}
}

Relevant links

System requirements

Linux (Tested on Ubuntu 18.04)
Python3 (Tested using Python 3.6.9)
PyTorch (Tested using PyTorch 1.8.1)
CUDA (Tested using CUDA 11.1)

Installation

a. Create a python virtual environment and activate it.

python3 -m venv panoptic_bev
source panoptic_bev/bin/activate

b. Update pip to the latest version.

python3 -m pip install --upgrade pip

c. Install the required python dependencies using the provided requirements.txt file.

pip3 install -r requirements.txt

d. Install the PanopticBEV code.

python3 setup.py develop

Obtaining the datasets

Please download the datasets from here and follow the instructions provided in the encapsulated readme file.

Code Execution

Configuration parameters

The configuration parameters of the model such as the learning rate, batch size, and dataloader options are stored in the experiments/config folder. If you intend to modify the model parameters, please do so here.

Training and Evaluation

The training and evaluation python codes along with the shell scripts to execute them are provided in the scripts folder. Before running the shell scripts, please fill in the missing parameters with your computer-specific data paths and parameters.

To train the model, execute the following command after replacing * with either kitti or nuscenes.

bash train_panoptic_bev_*.sh

To evaluate the model, execute the following command after replacing * with either kitti or nuscenes.

bash eval_panoptic_bev_*.sh

Acknowledgements

This work was supported by the Federal Ministry of Education and Research (BMBF) of Germany under ISA 4.0 and by the Eva Mayr-Stihl Stiftung.

This project contains code adapted from other open-source projects. We especially thank the authors of:

License

This code is released under the GPLv3 for academic usage. For commercial usage, please contact Nikhil Gosala.

PanopticBEV - Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images

Related tags

Overview

Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images

Relevant links

System requirements

Installation

Obtaining the datasets

Code Execution

Configuration parameters

Training and Evaluation

Acknowledgements

License

Owner

Tutorial page of the Climate Hack, the greatest hackathon ever

Diagnostic tests for linguistic capacities in language models

Ganilla - Official Pytorch implementation of GANILLA

ivadomed is an integrated framework for medical image analysis with deep learning.

This is the code for "HyperNeRF: A Higher-Dimensional Representation for Topologically Varying Neural Radiance Fields".

Generative Query Network (GQN) in PyTorch as described in "Neural Scene Representation and Rendering"

Devkit for 3D -- Some utils for 3D object detection based on Numpy and Pytorch

FactSeg: Foreground Activation Driven Small Object Semantic Segmentation in Large-Scale Remote Sensing Imagery (TGRS)

A study project using the AA-RMVSNet to reconstruct buildings from multiple images

VOLO: Vision Outlooker for Visual Recognition

This Deep Learning Model Predicts that from which disease you are suffering.

Robust Self-augmentation for NER with Meta-reweighting

Scribble-Supervised LiDAR Semantic Segmentation, CVPR 2022 (ORAL)

code for CVPR paper Zero-shot Instance Segmentation

Fully Convolutional DenseNets for semantic segmentation.

DeepI2I: Enabling Deep Hierarchical Image-to-Image Translation by Transferring from GANs

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Deep learning models for classification of 15 common weeds in the southern U.S. cotton production systems.

Use evolutionary algorithms instead of gridsearch in scikit-learn

DiffStride: Learning strides in convolutional neural networks