Det3D

A general 3D Object Detection codebase in PyTorch.

1. Introduction

Det3D is the first 3D Object Detection toolbox which provides off the box implementations of many 3D object detection algorithms such as PointPillars, SECOND, PIXOR, etc, as well as state-of-the-art methods on major benchmarks like KITTI(ViP) and nuScenes(CBGS). Key features of Det3D include the following aspects:

Multi Datasets Support: KITTI, nuScenes, Lyft
Point-based and Voxel-based model zoo
State-of-the-art performance
DDP & SyncBN

2. Installation

Please refer to INSTALATION.md.

3. Quick Start

Please refer to GETTING_STARTED.md.

4. Model Zoo

4.1 nuScenes

	mAP	mATE	mASE	mAOE	mAVE	mAAE	NDS	ckpt
CBGS	49.9	0.335	0.256	0.323	0.251	0.197	61.3	link
PointPillar	41.8	0.363	0.264	0.377	0.288	0.198	56.0	link

The original model and prediction files are available in the CBGS README.

4.2 KITTI

Second on KITTI(val) Dataset

car  AP @0.70, 0.70,  0.70:
bbox AP:90.54, 89.35, 88.43
bev  AP:89.89, 87.75, 86.81
3d   AP:87.96, 78.28, 76.99
aos  AP:90.34, 88.81, 87.66

PointPillars on KITTI(val) Dataset

car  [email protected],  0.70,  0.70:
bbox AP:90.63, 88.86, 87.35
bev  AP:89.75, 86.15, 83.00
3d   AP:85.75, 75.68, 68.93
aos  AP:90.48, 88.36, 86.58

4.3 Lyft

Lyft Config

4.4 Waymo

5. Functionality

Models
- VoxelNet
- SECOND
- PointPillars
Features
- Multi task learning & Multi-task Learning
- Distributed Training and Validation
- SyncBN
- Flexible anchor dimensions
- TensorboardX
- Checkpointer & Breakpoint continue
- Self-contained visualization
- Finetune
- Multiscale Training & Validation
- Rotated RoI Align

6. TODO List

To Be Released
- CGBS on Lyft(val) Dataset
Models
- PointRCNN
- PIXOR

7. Call for contribution.

Support Waymo Dataset.
Add other 3D detection / segmentation models, such as VoteNet, STD, etc.

8. Developers

Benjin Zhu , Bingqi Ma

9. License

Det3D is released under the Apache licenes.

10. Citation

Det3D is a derivative codebase of CBGS, if you find this work useful in your research, please consider cite:

@article{zhu2019class,
  title={Class-balanced Grouping and Sampling for Point Cloud 3D Object Detection},
  author={Zhu, Benjin and Jiang, Zhengkai and Zhou, Xiangxin and Li, Zeming and Yu, Gang},
  journal={arXiv preprint arXiv:1908.09492},
  year={2019}
}

A general 3D Object Detection codebase in PyTorch.

Related tags

Overview

Det3D

1. Introduction

2. Installation

3. Quick Start

4. Model Zoo

4.1 nuScenes

4.2 KITTI

Second on KITTI(val) Dataset

PointPillars on KITTI(val) Dataset

4.3 Lyft

4.4 Waymo

5. Functionality

6. TODO List

7. Call for contribution.

8. Developers

9. License

10. Citation

11. Acknowledgement

Owner

Benjin Zhu

Generative Adversarial Text-to-Image Synthesis

The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"

The Balloon Learning Environment - flying stratospheric balloons with deep reinforcement learning.

Resilience from Diversity: Population-based approach to harden models against adversarial attacks

EquiBind: Geometric Deep Learning for Drug Binding Structure Prediction

Video Matting via Consistency-Regularized Graph Neural Networks

EgoNN: Egocentric Neural Network for Point Cloud Based 6DoF Relocalization at the City Scale

Scalable Optical Flow-based Image Montaging and Alignment

Udacity's CS101: Intro to Computer Science - Building a Search Engine

The MLOps platform for innovators 🚀

A series of Python scripts to access measurements from Fluke 28X meters. Fluke IR Remote Interface required.

Hidden-Fold Networks (HFN): Random Recurrent Residuals Using Sparse Supermasks

Official pytorch implementation of Active Learning for deep object detection via probabilistic modeling (ICCV 2021)

Python library to receive live stream events like comments and gifts in realtime from TikTok LIVE.

Pytorch implementation of One-Shot Affordance Detection

Pytorch implementation of the paper "Enhancing Content Preservation in Text Style Transfer Using Reverse Attention and Conditional Layer Normalization"

Auto White-Balance Correction for Mixed-Illuminant Scenes

Consensus score for tripadvisor

PyTorch implementation of CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition

Keyword-BERT: Keyword-Attentive Deep Semantic Matching