Behind the Curtain: Learning Occluded Shapes for 3D Object Detection

Last update: Dec 19, 2022

Related tags

Deep Learning BtcDet

Overview

Behind the Curtain: Learning Occluded Shapes for 3D Object Detection

Acknowledgement

We implement our model, BtcDet, based on [OpenPcdet 0.3.0].

Installation

Requirements

All the codes are tested in the following environment:

Linux (tested on Ubuntu 14.04/16.04)
Python 3.6+
PyTorch 1.1 or higher (tested on PyTorch 1.7, 1.8.1, 1.9, 1.10)
CUDA 9.0 or higher (PyTorch 1.3+ needs CUDA 9.2+, test on CUDA 10.2)
spconv v1.2.1 (commit fad3000249d27ca918f2655ff73c41f39b0f3127)

Install

b. Install the dependent libraries as follows:

Install the dependent python libraries:

pip install -r requirements.txt

Install the SparseConv library, we use the implementation from [spconv].
- If you use PyTorch 1.1, then make sure you install the spconv v1.0 with (commit 8da6f96) instead of the latest one.
- If you use PyTorch 1.3+, then you need to install the spconv v1.2. As mentioned by the author of spconv
```
git clone -b v1.2.1  https://github.com/traveller59/spconv.git --recursive

cd spconv

sudo apt-get install libboost-all-dev

python setup.py bdist_wheel

cd ./dist 
```
then use pip to install generated whl file.
```
pip install spconv-1.2.1-{your system info}.whl
```
After that, you should first get out of the spconv directory, then do python import spconv to see if you installed it correctly.

c. Install this btcdet library by running the following command:

cd btcdet
python setup.py develop

Preparation

KITTI Dataset

Please download the official KITTI 3D object detection dataset and organize the downloaded files as follows (the road planes could be downloaded from [road plane], which are optional for data augmentation in the training):

BtcDet
├── data
│   ├── kitti
    │   │   │──detection3d  │── ImageSets
                    │   │   │── training
                    │   │   │   ├──calib & velodyne & label_2 & image_2 & (optional: planes) & (optional: depth_2)
                    │   │   │── testing
                    │   │   │   ├──calib & velodyne & image_2

Generate the data infos by running the following command:

python -m btcdet.datasets.kitti.kitti_dataset create_kitti_infos tools/cfgs/dataset_configs/kitti_dataset.yaml

Generate Approximated complete object points:

(at btcdet directory, execute:)

python -m btcdet.datasets.multifindbestfit

Alternatively,

you can use our generated kitti's data including the generated complete object points, download it [here (about 31GBs)] and put the zip file inside data/kitti/ and unzip it as detection3d directory.

Run training:

cd tools/

Single gpu training

mkdir output

mkdir output/kitti_car

python train.py --cfg_file ./cfgs/model_configs/btcdet_kitti_car.yaml --output_dir ../output/kitti_car/ --batch_size 2

Multi gpu training

bash scripts/dist_train.sh 4  --batch_size 8 --gpu_str "0,1,2,3" --cfg_file ./cfgs/model_configs/btcdet_kitti_car.yaml --output_dir ../output/kitti_car/

Behind the Curtain: Learning Occluded Shapes for 3D Object Detection

Related tags

Overview

Behind the Curtain: Learning Occluded Shapes for 3D Object Detection

Acknowledgement

Installation

Requirements

Install

Preparation

KITTI Dataset

Generate Approximated complete object points:

Alternatively,

Run training:

Owner

Qiangeng Xu

Mercer Gaussian Process (MGP) and Fourier Gaussian Process (FGP) Regression

Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.

Code for Multimodal Neural SLAM for Interactive Instruction Following

Rasterize with the least efforts for researchers.

Adversarial Texture Optimization from RGB-D Scans (CVPR 2020).

The Face Mask recognition system uses AI technology to detect the person with or without a mask.

Tweesent-back - Tweesent backend uses fastAPI as the web framework

HyperLib: Deep learning in the Hyperbolic space

Implementation based on Paper - Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling

Predicting Axillary Lymph Node Metastasis in Early Breast Cancer Using Deep Learning on Primary Tumor Biopsy Slides

Code for "Long-tailed Distribution Adaptation"

GrailQA: Strongly Generalizable Question Answering

R-Drop: Regularized Dropout for Neural Networks

SegTransVAE: Hybrid CNN - Transformer with Regularization for medical image segmentation

In real-world applications of machine learning, reliable and safe systems must consider measures of performance beyond standard test set accuracy

MultiLexNorm 2021 competition system from ÚFAL

particle tracking model, works with the ROMS output file(qck.nc, his.nc)

CIFAR-10_train-test - training and testing codes for dataset CIFAR-10

Flow is a computational framework for deep RL and control experiments for traffic microsimulation.

Robbing the FED: Directly Obtaining Private Data in Federated Learning with Modified Models