Pyramid R-CNN: Towards Better Performance and Adaptability for 3D Object Detection

Last update: Jan 07, 2023

Related tags

Deep Learning Pyramid-RCNN

Overview

Pyramid R-CNN

This is a reproduced repo of Pyramid R-CNN for 3D object detection.

The code is mainly based on OpenPCDet.

Introduction

We provide code and training configurations of Pyramid-V/PV on the KITTI and Waymo Open dataset. Checkpoints will not be released. The dataset organization is same with PCDet.

Requirements

The codes are tested in the following environment:

Ubuntu 18.04
Python 3.6
PyTorch 1.5
CUDA 10.1
OpenPCDet v0.3.0
spconv v1.2.1

Installation

a. Clone this repository.

git clone https://github.com/PointsCoder/Pyramid_R-CNN.git

b. Install the dependent libraries as follows:

Install the dependent python libraries:

pip install -r requirements.txt

Install the SparseConv library, we use the implementation from [spconv].
- If you use PyTorch 1.1, then make sure you install the spconv v1.0 with (commit 8da6f96) instead of the latest one.
- If you use PyTorch 1.3+, then you need to install the spconv v1.2. As mentioned by the author of spconv, you need to use their docker if you use PyTorch 1.4+.

c. Compile CUDA operators by running the following command:

python setup.py develop

Training

We train all the models with 8 Tesla V100 GPU (32Gb), and all the configs (epochs/learning rate/batch size) are for 8-GPU Distributed Data Parallel (DDP) training. Users may change those training parameters if they want to run with different GPU numbers and memories.

models

# pyramid_rcnn_pv.yaml: pyramid roi head on the point-voxel backbone
# pyramid_rcnn_v.yaml: pyramid roi head on the spconv u-net backbone

DDP training

CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 sh scripts/dist_train.sh 8 --cfg_file cfgs/waymo_models/pyramid_rcnn_pv.yaml

DDP testing

CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 sh scripts/dist_test.sh 8 --cfg_file cfgs/waymo_models/pyramid_rcnn_pv.yaml --eval_all

Citation

If you find this project useful in your research, please consider cite:

@article{mao2021pyramid,
  title={Pyramid R-CNN: Towards Better Performance and Adaptability for 3D Object Detection},
  author={Mao, Jiageng and Niu, Minzhe and Bai, Haoyue and Liang, Xiaodan and Xu, Hang and Xu, Chunjing},
  journal={ICCV},
  year={2021}
}

Pyramid R-CNN: Towards Better Performance and Adaptability for 3D Object Detection

Related tags

Overview

Pyramid R-CNN

Introduction

Requirements

Installation

Training

Citation

Owner

Danfeng Hong, Lianru Gao, Jing Yao, Bing Zhang, Antonio Plaza, Jocelyn Chanussot. Graph Convolutional Networks for Hyperspectral Image Classification, IEEE TGRS, 2021.

Next-Best-View Estimation based on Deep Reinforcement Learning for Active Object Classification

Deep Learning Slide Captcha

Official repository for ABC-GAN

Bridging Vision and Language Model

A Pytorch implementation of MoveNet from Google. Include training code and pre-train model.

MARS: Learning Modality-Agnostic Representation for Scalable Cross-media Retrieva

a minimal terminal with python 😎😉

UmlsBERT: Clinical Domain Knowledge Augmentation of Contextual Embeddings Using the Unified Medical Language System Metathesaurus

UNAVOIDS: Unsupervised and Nonparametric Approach for Visualizing Outliers and Invariant Detection Scoring

Official pytorch code for SSAT: A Symmetric Semantic-Aware Transformer Network for Makeup Transfer and Removal

Official Repsoitory for "Activate or Not: Learning Customized Activation." [CVPR 2021]

[CVPR'21] Learning to Recommend Frame for Interactive Video Object Segmentation in the Wild

A pytorch &keras implementation and demo of Fastformer.

Code for testing various M1 Chip benchmarks with TensorFlow.

Projects for AI/ML and IoT integration for games and other presented at re:Invent 2021.

Music Source Separation; Train & Eval & Inference piplines and pretrained models we used for 2021 ISMIR MDX Challenge.

Code for IntraQ, PyTorch implementation of our paper under review

Deep universal probabilistic programming with Python and PyTorch

Official implementation of ACTION-Net: Multipath Excitation for Action Recognition (CVPR'21).