Behind the Curtain: Learning Occluded Shapes for 3D Object Detection

Last update: Dec 19, 2022

Related tags

Deep Learning BtcDet

Overview

Behind the Curtain: Learning Occluded Shapes for 3D Object Detection

Acknowledgement

We implement our model, BtcDet, based on [OpenPcdet 0.3.0].

Installation

Requirements

All the codes are tested in the following environment:

Linux (tested on Ubuntu 14.04/16.04)
Python 3.6+
PyTorch 1.1 or higher (tested on PyTorch 1.7, 1.8.1, 1.9, 1.10)
CUDA 9.0 or higher (PyTorch 1.3+ needs CUDA 9.2+, test on CUDA 10.2)
spconv v1.2.1 (commit fad3000249d27ca918f2655ff73c41f39b0f3127)

Install

b. Install the dependent libraries as follows:

Install the dependent python libraries:

pip install -r requirements.txt

Install the SparseConv library, we use the implementation from [spconv].
- If you use PyTorch 1.1, then make sure you install the spconv v1.0 with (commit 8da6f96) instead of the latest one.
- If you use PyTorch 1.3+, then you need to install the spconv v1.2. As mentioned by the author of spconv
```
git clone -b v1.2.1  https://github.com/traveller59/spconv.git --recursive

cd spconv

sudo apt-get install libboost-all-dev

python setup.py bdist_wheel

cd ./dist 
```
then use pip to install generated whl file.
```
pip install spconv-1.2.1-{your system info}.whl
```
After that, you should first get out of the spconv directory, then do python import spconv to see if you installed it correctly.

c. Install this btcdet library by running the following command:

cd btcdet
python setup.py develop

Preparation

KITTI Dataset

Please download the official KITTI 3D object detection dataset and organize the downloaded files as follows (the road planes could be downloaded from [road plane], which are optional for data augmentation in the training):

BtcDet
├── data
│   ├── kitti
    │   │   │──detection3d  │── ImageSets
                    │   │   │── training
                    │   │   │   ├──calib & velodyne & label_2 & image_2 & (optional: planes) & (optional: depth_2)
                    │   │   │── testing
                    │   │   │   ├──calib & velodyne & image_2

Generate the data infos by running the following command:

python -m btcdet.datasets.kitti.kitti_dataset create_kitti_infos tools/cfgs/dataset_configs/kitti_dataset.yaml

Generate Approximated complete object points:

(at btcdet directory, execute:)

python -m btcdet.datasets.multifindbestfit

Alternatively,

you can use our generated kitti's data including the generated complete object points, download it [here (about 31GBs)] and put the zip file inside data/kitti/ and unzip it as detection3d directory.

Run training:

cd tools/

Single gpu training

mkdir output

mkdir output/kitti_car

python train.py --cfg_file ./cfgs/model_configs/btcdet_kitti_car.yaml --output_dir ../output/kitti_car/ --batch_size 2

Multi gpu training

bash scripts/dist_train.sh 4  --batch_size 8 --gpu_str "0,1,2,3" --cfg_file ./cfgs/model_configs/btcdet_kitti_car.yaml --output_dir ../output/kitti_car/

Behind the Curtain: Learning Occluded Shapes for 3D Object Detection

Related tags

Overview

Behind the Curtain: Learning Occluded Shapes for 3D Object Detection

Acknowledgement

Installation

Requirements

Install

Preparation

KITTI Dataset

Generate Approximated complete object points:

Alternatively,

Run training:

Owner

Qiangeng Xu

Structured Edge Detection Toolbox

A C implementation for creating 2D voronoi diagrams

AI pipelines for Nvidia Jetson Platform

Pytorch implementation of Nueral Style transfer

Convert scikit-learn models to PyTorch modules

Official repository for Few-shot Image Generation via Cross-domain Correspondence (CVPR '21)

ESGD-M - A stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch

A Fast Knowledge Distillation Framework for Visual Recognition

LoFTR:Detector-Free Local Feature Matching with Transformers CVPR 2021

Generating Digital Painting Lighting Effects via RGB-space Geometry (SIGGRAPH2020/TOG2020)

EdMIPS: Rethinking Differentiable Search for Mixed-Precision Neural Networks

Repository for paper "Non-intrusive speech intelligibility prediction from discrete latent representations"

Hierarchical User Intent Graph Network for Multimedia Recommendation

A repository for the paper "Improved Adversarial Systems for 3D Object Generation and Reconstruction".

[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations

Code for Contrastive-Geometry Networks for Generalized 3D Pose Transfer

WHENet: Real-time Fine-Grained Estimation for Wide Range Head Pose

Amazing-Python-Scripts - 🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.

Expressive Body Capture: 3D Hands, Face, and Body from a Single Image

Code for the paper "Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks"