Code for LIGA-Stereo Detector, ICCV'21

Last update: Dec 09, 2022

Related tags

Overview

LIGA-Stereo

Introduction

This is the official implementation of the paper LIGA-Stereo: Learning LiDAR Geometry Aware Representations for Stereo-based 3D Detector, In ICCV'21, Xiaoyang Guo, Shaoshuai Shi, Xiaogang Wang and Hongsheng Li.

[project page] [paper] [code]

Installation

Requirements

All the codes are tested in the following environment:

Linux (tested on Ubuntu 14.04 / 16.04)
Python 3.7
PyTorch 1.6.0
Torchvision 0.7.0
CUDA 9.2 / 10.1
spconv (commit f22dd9)

Installation Steps

a. Clone this repository.

git clone https://github.com/xy-guo/LIGA.git

b. Install the dependent libraries as follows:

Install the dependent python libraries:

pip install -r requirements.txt

Install the SparseConv library, we use the implementation from [spconv].

git clone https://github.com/traveller59/spconv
git reset --hard f22dd9
git submodule update --recursive
python setup.py bdist_wheel
pip install ./dist/spconv-1.2.1-cp37-cp37m-linux_x86_64.whl

Install modified mmdetection from [mmdetection_kitti]

git clone https://github.com/xy-guo/mmdetection_kitti
python setup.py develop

c. Install this library by running the following command:

python setup.py develop

Getting Started

The dataset configs are located within configs/stereo/dataset_configs, and the model configs are located within configs/stereo for different datasets.

Dataset Preparation

Currently we only provide the dataloader of KITTI dataset.

Please download the official KITTI 3D object detection dataset and organize the downloaded files as follows (the road planes are provided by OpenPCDet [road plane], which are optional for training LiDAR models):

LIGA_PATH
├── data
│   ├── kitti
│   │   │── ImageSets
│   │   │── training
│   │   │   ├──calib & velodyne & label_2 & image_2 & (optional: planes)
│   │   │── testing
│   │   │   ├──calib & velodyne & image_2
├── configs
├── liga
├── tools

You can also choose to link your KITTI dataset path by

YOUR_KITTI_DATA_PATH=~/data/kitti_object
ln -s $YOUR_KITTI_DATA_PATH/training/ ./data/kitti/
ln -s $YOUR_KITTI_DATA_PATH/testing/ ./data/kitti/

Generate the data infos by running the following command:

python -m liga.datasets.kitti.kitti_dataset create_kitti_infos
python -m liga.datasets.kitti.kitti_dataset create_gt_database_only

Training & Testing

Test and evaluate the pretrained models

To test with multiple GPUs:

./scripts/dist_test_ckpt.sh ${NUM_GPUS} ./configs/stereo/kitti_models/liga.yaml ./ckpt/pretrained_liga.pth

Train a model

Train with multiple GPUs

./scripts/dist_train.sh ${NUM_GPUS} 'exp_name' ./configs/stereo/kitti_models/liga.yaml

Pretrained Models

Google Drive

Citation

@InProceedings{Guo_2021_ICCV,
    author = {Guo, Xiaoyang and Shi, Shaoshuai and Wang, Xiaogang and Li, Hongsheng},
    title = {LIGA-Stereo: Learning LiDAR Geometry Aware Representations for Stereo-based 3D Detector},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month = {October},
    year = {2021}
}

Acknowledgements

Part of codes are migrated from OpenPCDet and DSGN.

Code for LIGA-Stereo Detector, ICCV'21

Related tags

Overview

LIGA-Stereo

Introduction

Overview

Installation

Requirements

Installation Steps

Getting Started

Dataset Preparation

Training & Testing

Test and evaluate the pretrained models

Train a model

Pretrained Models

Citation

Acknowledgements

Owner

Xiaoyang Guo

[BMVC 2021] Official PyTorch Implementation of Self-supervised learning of Image Scale and Orientation Estimation

PyTorch implementation of MoCo v3 for self-supervised ResNet and ViT.

Our solution for SSN Invente 2021's Hackathon

Paddle implementation for "Cross-Lingual Word Embedding Refinement by ℓ1 Norm Optimisation" (NAACL 2021)

A general framework for deep learning experiments under PyTorch based on pytorch-lightning

A Jupyter notebook to play with NVIDIA's StyleGAN3 and OpenAI's CLIP for a text-based guided image generation.

A crossplatform menu bar application using mpv as DLNA Media Renderer.

A small fun project using python OpenCV, mediapipe, and pydirectinput

This repository contains tutorials for the py4DSTEM Python package

Pytorch implementation of various High Dynamic Range (HDR) Imaging algorithms

Repository for MeshTalk supplemental material and code once the (already approved) 16 GHS captures our lab will make publicly available are released.

Simple, efficient and flexible vision toolbox for mxnet framework.

Create time-series datacubes for supervised machine learning with ICEYE SAR images.

Source code, datasets and trained models for the paper Learning Advanced Mathematical Computations from Examples (ICLR 2021), by François Charton, Amaury Hayat (ENPC-Rutgers) and Guillaume Lample

COCO Style Dataset Generator GUI

FedCV: A Federated Learning Framework for Diverse Computer Vision Tasks

Interpretation of T cell states using reference single-cell atlases

Code for the paper Open Sesame: Getting Inside BERT's Linguistic Knowledge.

SiamMOT is a region-based Siamese Multi-Object Tracking network that detects and associates object instances simultaneously.

A Robust Unsupervised Ensemble of Feature-Based Explanations using Restricted Boltzmann Machines