Awesome Monocular 3D detection

Overview

Awesome Monocular 3D detection

Paper list of 3D detetction, keep updating!

Contents

Paper List

2022

  • [MonoDistill] MonoDistill: Learning Spatial Features for Monocular 3D Object Detection [ICLR2022][Pytorch]
  • [MonoCon] Learning Auxiliary Monocular Contexts Helps Monocular 3D Object Detection [AAAI2022][Pytorch]
  • [ImVoxelNet] ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection [WACV2022][Pytorch]

2021

  • [PCT] Progressive Coordinate Transforms for Monocular 3D Object Detection [NeurIPS2021][Pytorch]
  • [DFR-Net] The Devil Is in the Task: Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object Detection [ICCV2021]
  • [AutoShape] AutoShape: Real-Time Shape-Aware Monocular 3D Object Detection [ICCV2021][Pytorch][Paddle]
  • [pseudo-analysis] Are we Missing Confidence in Pseudo-LiDAR Methods for Monocular 3D Object Detection? [ICCV2021]
  • [Gated3D] Gated3D: Monocular 3D Object Detection From Temporal Illumination Cues [ICCV2021]
  • [MonoRCNN] Geometry-based Distance Decomposition for Monocular 3D Object Detection [ICCV2021][Pytorch]
  • [DD3D] Is Pseudo-Lidar needed for Monocular 3D Object detection [ICCV2021][Pytorch]
  • [GUPNet] Geometry Uncertainty Projection Network for Monocular 3D Object Detection [ICCV2021][Pytorch]
  • [Neighbor-Vote] Neighbor-Vote: Improving Monocular 3D Object Detection through Neighbor Distance Voting [ACMMM2021]
  • [MonoEF] Monocular 3D Object Detection: An Extrinsic Parameter Free Approach [CVPR2021][Pytorch]
  • [monodle] Delving into Localization Errors for Monocular 3D Object Detection [CVPR2021][Pytorch]
  • [Monoflex] Objects are Different: Flexible Monocular 3D Object Detection [CVPR2021][Pytorch]
  • [GrooMeD-NMS] GrooMeD-NMS: Grouped Mathematically Differentiable NMS for Monocular 3D Object Detection [CVPR2021][Pytorch]
  • [DDMP-3D] Depth-conditioned Dynamic Message Propagation for Monocular 3D Object Detection [CVPR2021][Pytorch]
  • [MonoRUn] MonoRUn: Monocular 3D Object Detection by Reconstruction and Uncertainty Propagation [CVPR2021][Pytorch]
  • [M3DSSD] M3DSSD: Monocular 3D Single Stage Object Detector [CVPR2021][Pytorch]
  • [CaDDN] Categorical Depth Distribution Network for Monocular 3D Object Detection [CVPR2021][Pytorch]
  • [visualDet3D] Ground-aware Monocular 3D Object Detection for Autonomous Driving [RA-L][Pytorch]

2020

  • [UR3D] Distance-Normalized Unified Representation for Monocular 3D Object Detection [ECCV2020]
  • [MonoDR] Monocular Differentiable Rendering for Self-Supervised 3D Object Detection [ECCV2020]
  • [DA-3Ddet] Monocular 3d object detection via feature domain adaptation [ECCV2020]
  • [MoVi-3D] Towards generalization across depth for monocular 3d object detection [ECCV2020]
  • [PatchNet] Rethinking Pseudo-LiDAR Representation [ECCV2020][Pytorch]
  • [RAR-Net] Reinforced Axial Refinement Network for Monocular 3D Object Detection [ECCV2020]
  • [kinematic3d] Kinematic 3D Object Detection in Monocular Video [ECCV2020][Pytorch]
  • [RTM3D] RTM3D: Real-time Monocular 3D Detection from Object Keypoints for Autonomous Driving [ECCV2020][Pytorch]
  • [SMOKE] SMOKE: Single-Stage Monocular 3D Object Detection via Keypoint Estimation [CVPRW2020][Pytorch]
  • [D4LCN] Learning Depth-Guided Convolutions for Monocular 3D Object Detection [CVPRW2020][Pytorch]
  • [MonoPair] MonoPair: Monocular 3D Object Detection Using Pairwise Spatial Relationships [CVPR2020]
  • [pseudo-LiDAR_e2e] End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection [CVPR2020][Pytorch]
  • [Pseudo-LiDAR++] Pseudo-LiDAR++: Accurate Depth for 3D Object Detection in Autonomous Driving [ICLR2020][Pytorch]
  • [OACV] Object-Aware Centroid Voting for Monocular 3D Object Detection [IROS2020]
  • [MonoGRNet_v2] Monocular 3D Object Detection via Geometric Reasoning on Keypoints [VISIGRAPP2020]
  • [ForeSeE] Task-Aware Monocular Depth Estimation for 3D Object Detection [AAAI2020(oral)][Pytorch]
  • [Decoupled-3D] Monocular 3D Object Detection with Decoupled Structured Polygon Estimation and Height-Guided Depth Estimation [AAAI2020]

2019

  • [3d-vehicle-tracking] Joint Monocular 3D Vehicle Detection and Tracking [ICCV2019][Pytorch]
  • [MonoDIS] Disentangling monocular 3d object detection [ICCV2019]
  • [AM3D] Accurate Monocular Object Detection via Color-Embedded 3D Reconstruction for Autonomous Driving [ICCV2019]
  • [M3D-RPN] M3D-RPN: Monocular 3D Region Proposal Network for Object Detection [ICCV2019(Oral)][Pytorch]
  • [MVRA] Multi-View Reprojection Architecture for Orientation Estimation [ICCVW2019]
  • [Mono3DPLiDAR] Monocular 3D Object Detection with Pseudo-LiDAR Point Cloud [ICCVW2019]
  • [MonoPSR] Monocular 3D Object Detection Leveraging Accurate Proposals and Shape Reconstruction [CVPR2019][Pytorch]
  • [FQNet] Deep fitting degree scoring network for monocular 3d object detection [CVPR2019]
  • [ROI-10D] ROI-10D: Monocular Lifting of 2D Detection to 6D Pose and Metric Shape [CVPR2019]
  • [GS3D] GS3D: An Efficient 3D Object Detection Framework for Autonomous Driving [CVPR2019]
  • [Pseudo-LiDAR] Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving [CVPR2019][Pytorch]
  • [BirdGAN] Learning 2D to 3D Lifting for Object Detection in 3D for Autonomous Vehicles [IROS2019]
  • [MonoGRNet] MonoGRNet: A Geometric Reasoning Network for Monocular 3D Object Localization [AAAI2019(oral)][Tensorflow]
  • [OFT-Net] Orthographic feature transform for monocular 3d object detection [BMVC2019][Pytorch]
  • [Shift R-CNN] Shift R-CNN: Deep Monocular 3D Object Detection with Closed-Form Geometric Constraints [TIP2019]
  • [SS3D] SS3D: Monocular 3d object detection and box fitting trained end-to-end using intersection-over-union loss [Arxiv2019]

2018

  • [Multi-Fusion] Multi-Level Fusion based 3D Object Detection from Monocular Images [CVPR2018][Pytorch]
  • [Mono3D++] Mono3D++: Monocular 3D Vehicle Detection with Two-Scale 3D Hypotheses and Task Priors [AAAI2018]

2017

  • [Deep3DBox] 3D Bounding Box Estimation Using Deep Learning and Geometry [CVPR2017][Pytorch][Tensorflow]
  • [Deep MANTA] Deep MANTA: A Coarse-to-fine Many-Task Network for joint 2D and 3D vehicle analysis from monocular image [CVPR2017]

2016

  • [Mono3D] Monocular 3D object detection for autonomous driving [CVPR2016]

KITTI Results

Method Extra Test, AP3D|R40 Val, AP3D|R40 Val, AP3D|R11 Reference
Easy Mod. Hard Easy Mod. Hard Easy Mod. Hard
MonoRUn Lidar 19.65 12.30 10.58 20.02 14.65 12.61 - - - CVPR2021
CaDDN Lidar 19.17 13.41 11.46 23.57 16.31 13.84 - - - CVPR2021
AM3D Depth 16.50 10.74 9.52 28.31 15.76 12.24 32.23 21.09 17.26 ICCV2019
PatchNet Depth 15.68 11.12 10.17 31.60 16.80 13.80 35.10 22.00 19.60 ECCV2020
D4LCN Depth 16.65 11.72 9.51 22.32 16.20 12.30 26.97 21.72 18.22 CVPRW2020
DFR-Net Depth 19.40 13.63 10.35 24.81 17.78 14.41 28.80 22.88 19.47 ICCV2021
M3D-RPN None 14.76 9.71 7.42 14.53 11.07 8.65 20.27 17.06 15.21 ICCV2019
SMOKE None 14.03 9.76 7.84 - - - 14.76 12.85 11.50 CVPRW2020
MonoPair None 13.04 9.99 8.65 16.28 12.30 10.42 - - - CVPR2020
RTM3D None 14.41 10.34 8.77 - - - 20.77 16.86 16.63 ECCV2020
M3DSSD None 17.51 11.46 8.98 - - - 27.77 21.67 18.28 CVPR2021
Monoflex None 19.94 13.89 12.07 23.64 17.51 14.83 28.17 21.92 19.07 CVPR2021
GUPNet None 20.11 14.20 11.77 22.76 16.46 13.72 - - - ICCV2021
MonoCon None 22.50 16.46 13.95 26.33 19.01 15.98 - - - AAAI2022
Owner
Zhikang Zou
Baidu Inc.
Zhikang Zou
A python toolbox for predictive uncertainty quantification, calibration, metrics, and visualization

Website, Tutorials, and Docs    Uncertainty Toolbox A python toolbox for predictive uncertainty quantification, calibration, metrics, and visualizatio

Uncertainty Toolbox 1.4k Dec 28, 2022
Neon-erc20-example - Example of creating SPL token and wrapping it with ERC20 interface in Neon EVM

Example of wrapping SPL token by ERC2-20 interface in Neon Requirements Install

7 Mar 28, 2022
This repo is customed for VisDrone.

Object Detection for VisDrone(无人机航拍图像目标检测) My environment 1、Windows10 (Linux available) 2、tensorflow = 1.12.0 3、python3.6 (anaconda) 4、cv2 5、ensemble

53 Jul 17, 2022
This code finds bounding box of a single human mouth.

This code finds bounding box of a single human mouth. In comparison to other face segmentation methods, it is relatively insusceptible to open mouth conditions, e.g., yawning, surgical robots, etc. T

iThermAI 4 Nov 27, 2022
General-purpose program synthesiser

DeepSynth General-purpose program synthesiser. This is the repository for the code of the paper "Scaling Neural Program Synthesis with Distribution-ba

Nathanaël Fijalkow 24 Oct 23, 2022
Repository for GNSS-based position estimation using a Deep Neural Network

Code repository accompanying our work on 'Improving GNSS Positioning using Neural Network-based Corrections'. In this paper, we present a Deep Neural

32 Dec 13, 2022
MVSDF - Learning Signed Distance Field for Multi-view Surface Reconstruction

MVSDF - Learning Signed Distance Field for Multi-view Surface Reconstruction This is the official implementation for the ICCV 2021 paper Learning Sign

110 Dec 20, 2022
Authors implementation of LieTransformer: Equivariant Self-Attention for Lie Groups

LieTransformer This repository contains the implementation of the LieTransformer used for experiments in the paper LieTransformer: Equivariant self-at

35 Oct 18, 2022
UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language

UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language This repository contains UA-GEC data and an accompanying Python lib

Grammarly 226 Dec 29, 2022
Official PyTorch implementation of "ArtFlow: Unbiased Image Style Transfer via Reversible Neural Flows"

ArtFlow Official PyTorch implementation of the paper: ArtFlow: Unbiased Image Style Transfer via Reversible Neural Flows Jie An*, Siyu Huang*, Yibing

123 Dec 27, 2022
Graph Transformer Architecture. Source code for

Graph Transformer Architecture Source code for the paper "A Generalization of Transformer Networks to Graphs" by Vijay Prakash Dwivedi and Xavier Bres

NTU Graph Deep Learning Lab 561 Jan 08, 2023
An open-source Deep Learning Engine for Healthcare that aims to treat & prevent major diseases

AlphaCare Background AlphaCare is a work-in-progress, open-source Deep Learning Engine for Healthcare that aims to treat and prevent major diseases. T

Siraj Raval 44 Nov 05, 2022
LRBoost is a scikit-learn compatible approach to performing linear residual based stacking/boosting.

LRBoost is a sckit-learn compatible package for linear residual boosting. LRBoost combines a linear estimator and a non-linear estimator to leverage t

Andrew Patton 5 Nov 23, 2022
"SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image", Dejia Xu, Yifan Jiang, Peihao Wang, Zhiwen Fan, Humphrey Shi, Zhangyang Wang

SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image [Paper] [Website] Pipeline Code Environment pip install -r requirements

VITA 250 Jan 05, 2023
This repository is based on Ultralytics/yolov5, with adjustments to enable rotate prediction boxes.

Rotate-Yolov5 This repository is based on Ultralytics/yolov5, with adjustments to enable rotate prediction boxes. Section I. Description The codes are

xinzelee 90 Dec 13, 2022
Code for "Solving Graph-based Public Good Games with Tree Search and Imitation Learning"

Code for "Solving Graph-based Public Good Games with Tree Search and Imitation Learning" This is the code for the paper Solving Graph-based Public Goo

Victor-Alexandru Darvariu 3 Dec 05, 2022
A best practice for tensorflow project template architecture.

A best practice for tensorflow project template architecture.

Mahmoud Gamal Salem 3.6k Dec 22, 2022
[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

Discriminative Region-based Multi-Label Zero-Shot Learning (ICCV 2021) [arXiv][Project page coming soon] Sanath Narayan*, Akshita Gupta*, Salman Kh

Akshita Gupta 54 Nov 21, 2022
PyTorch implementation for Convolutional Networks with Adaptive Inference Graphs

Convolutional Networks with Adaptive Inference Graphs (ConvNet-AIG) This repository contains a PyTorch implementation of the paper Convolutional Netwo

Andreas Veit 176 Dec 07, 2022