The source code of "SIDE: Center-based Stereo 3D Detector with Structure-aware Instance Depth Estimation", accepted to WACV 2022.

Last update: Dec 18, 2022

Related tags

Deep Learning SIDE

Overview

SIDE: Center-based Stereo 3D Detector with Structure-aware Instance Depth Estimation

The source code of our work "SIDE: Center-based Stereo 3D Detector with Structure-aware Instance Depth Estimation", accepted to WACV 2022

Installation

Requirements

Python >= 3.6
PyTorch >= 1.2
COCOAPI
DCNv2

Data Preparation

KITTI

Download the train-val split of 3DOP and SubCNN and place the data as below

  ${SIDE_ROOT}
  |-- data
  `-- |-- kitti
      `-- |-- training
          |   |-- image_2
          |   |-- label_2
          |   |-- calib
          |-- ImageSets_3dop
          |   |-- test.txt
          |   |-- train.txt
          |   |-- val.txt
          |   |-- trainval.txt
          `-- ImageSets_subcnn
              |-- test.txt
              |-- train.txt
              |-- val.txt
              |-- trainval.txt

Training

To train the kitti 3D object detection with dla on 4 GPUs, run

python testTrain.py stereo --exp_id sub_dla34 --dataset kitti --kitti_split subcnn --batch_size 16 --num_epochs 70 --lr_step 45,60 --gpus 0,1,2,3

By default, pytorch evenly splits the total batch size to each GPUs. --master_batch allows using different batchsize for the master GPU, which usually costs more memory than other GPUs. If it encounters GPU memory out, using slightly less batch size with the same learning is fine.

If the training is terminated before finishing, you can use the same commond with --resume to resume training. It will found the lastest model with the same exp_id.

Evaluation

To evaluate the kitti dataset, first compile the evaluation tool (from here):

cd SIDE_ROOT/src/tools/kitti_eval
g++ -o evaluate_object_3d_offline evaluate_object_3d_offline.cpp -O3

Then run the evaluation with pretrained model:

python testVal.py stereo --exp_id sub_dla34 --dataset kitti --kitti_split 3dop --resume

to evaluate the 3DOP split. For the subcnn split, change --kitti_split to subcnn and load the corresponding models.

License

SIDE itself is released under the MIT License (refer to the LICENSE file for details). Portions of the code are borrowed from CenterNet(anchor-free design), Stereo-RCNN(geometric constraint), DCNv2(deformable convolutions) and kitti_eval (KITTI dataset evaluation). Please refer to the original License of these projects (See NOTICE).

Reference

If you find our work useful in your research, please consider citing our paper:

@article{peng2021side,
  title={SIDE: Center-based Stereo 3D Detector with Structure-aware Instance Depth Estimation},
  author={Peng, Xidong and Zhu, Xinge and Wang, Tai and Ma, Yuexin},
  journal={arXiv preprint arXiv:2108.09663},
  year={2021}
}

The source code of "SIDE: Center-based Stereo 3D Detector with Structure-aware Instance Depth Estimation", accepted to WACV 2022.

Related tags

Overview

SIDE: Center-based Stereo 3D Detector with Structure-aware Instance Depth Estimation

Installation

Requirements

Data Preparation

KITTI

Training

Evaluation

License

Reference

Owner

Live training loss plot in Jupyter Notebook for Keras, PyTorch and others

Source code for paper "Deep Superpixel-based Network for Blind Image Quality Assessment"

Implementation of the Triangle Multiplicative module, used in Alphafold2 as an efficient way to mix rows or columns of a 2d feature map, as a standalone package for Pytorch

Code for Boundary-Aware Segmentation Network for Mobile and Web Applications

A Collection of Papers and Codes for ICCV2021 Low Level Vision and Image Generation

A library for hidden semi-Markov models with explicit durations

U-Net: Convolutional Networks for Biomedical Image Segmentation

TDN: Temporal Difference Networks for Efficient Action Recognition

Official code for our ICCV paper: "From Continuity to Editability: Inverting GANs with Consecutive Images"

This is a JAX implementation of Neural Radiance Fields for learning purposes.

Implementation of [Time in a Box: Advancing Knowledge Graph Completion with Temporal Scopes].

A unified framework for machine learning with time series

Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch

This is the pytorch implementation for the paper: Learning Accurate Performance Predictors for Ultrafast Automated Model Compression, which is in submission to TPAMI

Generating Videos with Scene Dynamics

Causal-Adversarial-Instruments - PyTorch Implementation for Developing Library of Investigating Adversarial Examples on A Causal View by Instruments

Official repository with code and data accompanying the NAACL 2021 paper "Hurdles to Progress in Long-form Question Answering" (https://arxiv.org/abs/2103.06332).

PyQt6 configuration in yaml format providing the most simple script.

Repository for the Bias Benchmark for QA dataset.

Music Generation using Neural Networks Streamlit App

The source code of "SIDE: Center-based Stereo 3D Detector with Structure-aware Instance Depth Estimation", accepted to WACV 2022.

Related tags

Overview

SIDE: Center-based Stereo 3D Detector with Structure-aware Instance Depth Estimation

Installation

Requirements

Data Preparation

KITTI

Training

Evaluation

License

Reference

Owner

Live training loss plot in Jupyter Notebook for Keras, PyTorch and others

Source code for paper "Deep Superpixel-based Network for Blind Image Quality Assessment"

Implementation of the Triangle Multiplicative module, used in Alphafold2 as an efficient way to mix rows or columns of a 2d feature map, as a standalone package for Pytorch

Code for Boundary-Aware Segmentation Network for Mobile and Web Applications

A Collection of Papers and Codes for ICCV2021 Low Level Vision and Image Generation

A library for hidden semi-Markov models with explicit durations

U-Net: Convolutional Networks for Biomedical Image Segmentation

TDN: Temporal Difference Networks for Efficient Action Recognition

Official code for our ICCV paper: "From Continuity to Editability: Inverting GANs with Consecutive Images"

This is a JAX implementation of Neural Radiance Fields for learning purposes.

Implementation of [Time in a Box: Advancing Knowledge Graph Completion with Temporal Scopes].

A unified framework for machine learning with time series

Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch

This is the pytorch implementation for the paper: *Learning Accurate Performance Predictors for Ultrafast Automated Model Compression*, which is in submission to TPAMI

Generating Videos with Scene Dynamics

Causal-Adversarial-Instruments - PyTorch Implementation for Developing Library of Investigating Adversarial Examples on A Causal View by Instruments

Official repository with code and data accompanying the NAACL 2021 paper "Hurdles to Progress in Long-form Question Answering" (https://arxiv.org/abs/2103.06332).

PyQt6 configuration in yaml format providing the most simple script.

Repository for the Bias Benchmark for QA dataset.

Music Generation using Neural Networks Streamlit App

This is the pytorch implementation for the paper: Learning Accurate Performance Predictors for Ultrafast Automated Model Compression, which is in submission to TPAMI