MonoRCNN is a monocular 3D object detection method for automonous driving

Last update: Dec 27, 2022

Related tags

Overview

MonoRCNN

MonoRCNN is a monocular 3D object detection method for automonous driving, published at ICCV 2021. This project is an implementation of MonoRCNN.

Visualization

Methodology

Installation

Python 3.6
PyTorch 1.5.0
Detectron2 0.1.3

Please use the Detectron2 included in this project. To ignore fully occluded objects during training, build.py, rpn.py, and roi_heads.py have been modified.

Dataset Preparation

KITTI

Model & Log

KITTI val1 split

Organize the downloaded files as follows:

├── projects
│   ├── MonoRCNN
│   │   ├── output
│   │   │   ├── model
│   │   │   ├── log.txt
│   │   │   ├── ...

Test

cd projects/MonoRCNN
./main.py --config-file config/MonoRCNN_KITTI.yaml --num-gpus 1 --resume --eval-only

Set VISUALIZE as True to visualize 3D object detection results (saved in output/evaluation/test/visualization).

Training

cd projects/MonoRCNN
./main.py --config-file config/MonoRCNN_KITTI.yaml --num-gpus 1

Citation

If you find this project useful in your research, please cite:

@inproceedings{MonoRCNN_ICCV21,
    title = {Geometry-based Distance Decomposition for Monocular 3D Object Detection},
    author = {Xuepeng Shi and Qi Ye and 
              Xiaozhi Chen and Chuangrong Chen and 
              Zhixiang Chen and Tae-Kyun Kim},
    booktitle = {ICCV},
    year = {2021},
}

Contact

[email protected]

MonoRCNN is a monocular 3D object detection method for automonous driving

Related tags

Overview

MonoRCNN

Visualization

Methodology

Related Link

Installation

Dataset Preparation

Model & Log

Test

Training

Citation

Contact

Acknowledgement

Owner

Code to compute permutation and drop-column importances in Python scikit-learn models

A Multi-modal Perception Tracker (MPT) for speaker tracking using both audio and visual modalities

tf2-keras implement yolov5

AutoDeeplab / auto-deeplab / AutoML for semantic segmentation, implemented in Pytorch

A machine learning benchmark of in-the-wild distribution shifts, with data loaders, evaluators, and default models.

Automatically measure the facial Width-To-Height ratio and get facial analysis results provided by Microsoft Azure

YOLOV4运行在嵌入式设备上

Pytorch implementation of RED-SDS (NeurIPS 2021).

This is the code for "HyperNeRF: A Higher-Dimensional Representation for Topologically Varying Neural Radiance Fields".

Repositorio oficial del curso IIC2233 Programación Avanzada 🚀✨

Some experiments with tennis player aging curves using Hilbert space GPs in PyMC. Only experimental for now.

A MNIST-like fashion product database. Benchmark

Bridging the Gap between Label- and Reference based Synthesis(ICCV 2021)

A lightweight tool to get an AI Infrastructure Stack up in minutes not days.

Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition

[ICCV2021] Official Pytorch implementation for SDGZSL (Semantics Disentangling for Generalized Zero-Shot Learning)

The Empirical Investigation of Representation Learning for Imitation (EIRLI)

code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022

Temporal Dynamic Convolutional Neural Network for Text-Independent Speaker Verification and Phonemetic Analysis

A Pytorch implementation of "LegoNet: Efficient Convolutional Neural Networks with Lego Filters" (ICML 2019).