PointPillars inference with TensorRT

Last update: Dec 31, 2022

Related tags

Overview

PointPillars inference with TensorRT

This repository contains sources and model for pointpillars inference using TensorRT. The model is created by OpenPCDet and modified by onnx_graphsurgeon.

Inference has four parts: generateVoxels: convert points cloud into voxels which has 4 channles generateFeatures: convert voxels into feature maps which has 10 channles Inference: convert feature maps to raw data of bounding box, class source and direction Postprocessing: parse bounding box, class source and direction

Data

The demo use the data from KITTI Dataset and more data can be downloaded following the linker GETTING_STARTED

Model

The onnx file can be converted from a model trainned by OpenPCDet with the tool in the demo.

Build

Prerequisites

To build the pointpillars inference, TensorRT with PillarScatter layer and CUDA are needed. PillarScatter layer plugin is already implemented as a plugin for TRT in the demo.

Jetpack 4.5
TensorRT v7.1.3
CUDA-10.2 + cuDNN-8.0.0
PCL is optinal to store pcd pointcloud file

Compile

$ cd test
$ mkdir build
$ cd build
$ make -j$(nproc)

Run

$ ./demo

Enviroments

Jetpack 4.5
Cuda10.2 + cuDNN8.0.0 + TensorRT 7.1.3
Nvidia Jetson AGX Xavier

Performance

FP16

|                   | GPU/ms | 
| ----------------- | ------ |
| generateVoxels    | 0.22   |
| generateFeatures  | 0.21   |
| Inference         | 30.75  |
| Postprocessing    | 3.19   |

Note

GPU processes all points at the same time and points selected form points cloud for a voxel randomly, so the output of generateVoxels has random value. Because CPU will select the first 32 points, the output of generateVoxels by CPU has fixed value.
The demo will cache the onnx file to improve performance. If a new onnx will be used, please remove the cache file in "./model"
MAX_VOXELS in params.h is used to allocate cache during inference. Decrease the value to save memory.

PointPillars inference with TensorRT

Related tags

Overview

PointPillars inference with TensorRT

Data

Model

Build

Prerequisites

Compile

Run

Enviroments

Performance

Note

References

Owner

NVIDIA AI IOT

Provably Rare Gem Miner.

Codebase to experiment with a hybrid Transformer that combines conditional sequence generation with regression

TorchXRayVision: A library of chest X-ray datasets and models.

Official Implementation of "LUNAR: Unifying Local Outlier Detection Methods via Graph Neural Networks"

Hierarchical-Bayesian-Defense - Towards Adversarial Robustness of Bayesian Neural Network through Hierarchical Variational Inference (Openreview)

Xview3 solution - XView3 challenge, 2nd place solution

SlotRefine: A Fast Non-Autoregressive Model forJoint Intent Detection and Slot Filling

Bootstrapped Unsupervised Sentence Representation Learning (ACL 2021)

tmm_fast is a lightweight package to speed up optical planar multilayer thin-film device computation.

Official code repository for the EMNLP 2021 paper

QAHOI: Query-Based Anchors for Human-Object Interaction Detection (paper)

7th place solution of Human Protein Atlas - Single Cell Classification on Kaggle

From this paper "SESNet: A Semantically Enhanced Siamese Network for Remote Sensing Change Detection"

A Python Reconnection Tool for alt:V

Automatic Number Plate Recognition using Contours and Convolution Neural Networks (CNN)

Space-invaders - Simple Game created using Python & PyGame, as my Beginner Python Project

Anchor-free Oriented Proposal Generator for Object Detection

Hyperparameters tuning and features selection are two common steps in every machine learning pipeline.

A modification of Daniel Russell's notebook merged with Katherine Crowson's hq-skip-net changes

CS_Final_Metal_surface_detection - This is a final project for CoderSchool Machine Learning bootcamp on 29/12/2021.