Awesome Monocular 3D detection

Overview

Awesome Monocular 3D detection

Paper list of 3D detetction, keep updating!

Contents

Paper List

2022

  • [MonoDistill] MonoDistill: Learning Spatial Features for Monocular 3D Object Detection [ICLR2022][Pytorch]
  • [MonoCon] Learning Auxiliary Monocular Contexts Helps Monocular 3D Object Detection [AAAI2022][Pytorch]
  • [ImVoxelNet] ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection [WACV2022][Pytorch]

2021

  • [PCT] Progressive Coordinate Transforms for Monocular 3D Object Detection [NeurIPS2021][Pytorch]
  • [DFR-Net] The Devil Is in the Task: Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object Detection [ICCV2021]
  • [AutoShape] AutoShape: Real-Time Shape-Aware Monocular 3D Object Detection [ICCV2021][Pytorch][Paddle]
  • [pseudo-analysis] Are we Missing Confidence in Pseudo-LiDAR Methods for Monocular 3D Object Detection? [ICCV2021]
  • [Gated3D] Gated3D: Monocular 3D Object Detection From Temporal Illumination Cues [ICCV2021]
  • [MonoRCNN] Geometry-based Distance Decomposition for Monocular 3D Object Detection [ICCV2021][Pytorch]
  • [DD3D] Is Pseudo-Lidar needed for Monocular 3D Object detection [ICCV2021][Pytorch]
  • [GUPNet] Geometry Uncertainty Projection Network for Monocular 3D Object Detection [ICCV2021][Pytorch]
  • [Neighbor-Vote] Neighbor-Vote: Improving Monocular 3D Object Detection through Neighbor Distance Voting [ACMMM2021]
  • [MonoEF] Monocular 3D Object Detection: An Extrinsic Parameter Free Approach [CVPR2021][Pytorch]
  • [monodle] Delving into Localization Errors for Monocular 3D Object Detection [CVPR2021][Pytorch]
  • [Monoflex] Objects are Different: Flexible Monocular 3D Object Detection [CVPR2021][Pytorch]
  • [GrooMeD-NMS] GrooMeD-NMS: Grouped Mathematically Differentiable NMS for Monocular 3D Object Detection [CVPR2021][Pytorch]
  • [DDMP-3D] Depth-conditioned Dynamic Message Propagation for Monocular 3D Object Detection [CVPR2021][Pytorch]
  • [MonoRUn] MonoRUn: Monocular 3D Object Detection by Reconstruction and Uncertainty Propagation [CVPR2021][Pytorch]
  • [M3DSSD] M3DSSD: Monocular 3D Single Stage Object Detector [CVPR2021][Pytorch]
  • [CaDDN] Categorical Depth Distribution Network for Monocular 3D Object Detection [CVPR2021][Pytorch]
  • [visualDet3D] Ground-aware Monocular 3D Object Detection for Autonomous Driving [RA-L][Pytorch]

2020

  • [UR3D] Distance-Normalized Unified Representation for Monocular 3D Object Detection [ECCV2020]
  • [MonoDR] Monocular Differentiable Rendering for Self-Supervised 3D Object Detection [ECCV2020]
  • [DA-3Ddet] Monocular 3d object detection via feature domain adaptation [ECCV2020]
  • [MoVi-3D] Towards generalization across depth for monocular 3d object detection [ECCV2020]
  • [PatchNet] Rethinking Pseudo-LiDAR Representation [ECCV2020][Pytorch]
  • [RAR-Net] Reinforced Axial Refinement Network for Monocular 3D Object Detection [ECCV2020]
  • [kinematic3d] Kinematic 3D Object Detection in Monocular Video [ECCV2020][Pytorch]
  • [RTM3D] RTM3D: Real-time Monocular 3D Detection from Object Keypoints for Autonomous Driving [ECCV2020][Pytorch]
  • [SMOKE] SMOKE: Single-Stage Monocular 3D Object Detection via Keypoint Estimation [CVPRW2020][Pytorch]
  • [D4LCN] Learning Depth-Guided Convolutions for Monocular 3D Object Detection [CVPRW2020][Pytorch]
  • [MonoPair] MonoPair: Monocular 3D Object Detection Using Pairwise Spatial Relationships [CVPR2020]
  • [pseudo-LiDAR_e2e] End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection [CVPR2020][Pytorch]
  • [Pseudo-LiDAR++] Pseudo-LiDAR++: Accurate Depth for 3D Object Detection in Autonomous Driving [ICLR2020][Pytorch]
  • [OACV] Object-Aware Centroid Voting for Monocular 3D Object Detection [IROS2020]
  • [MonoGRNet_v2] Monocular 3D Object Detection via Geometric Reasoning on Keypoints [VISIGRAPP2020]
  • [ForeSeE] Task-Aware Monocular Depth Estimation for 3D Object Detection [AAAI2020(oral)][Pytorch]
  • [Decoupled-3D] Monocular 3D Object Detection with Decoupled Structured Polygon Estimation and Height-Guided Depth Estimation [AAAI2020]

2019

  • [3d-vehicle-tracking] Joint Monocular 3D Vehicle Detection and Tracking [ICCV2019][Pytorch]
  • [MonoDIS] Disentangling monocular 3d object detection [ICCV2019]
  • [AM3D] Accurate Monocular Object Detection via Color-Embedded 3D Reconstruction for Autonomous Driving [ICCV2019]
  • [M3D-RPN] M3D-RPN: Monocular 3D Region Proposal Network for Object Detection [ICCV2019(Oral)][Pytorch]
  • [MVRA] Multi-View Reprojection Architecture for Orientation Estimation [ICCVW2019]
  • [Mono3DPLiDAR] Monocular 3D Object Detection with Pseudo-LiDAR Point Cloud [ICCVW2019]
  • [MonoPSR] Monocular 3D Object Detection Leveraging Accurate Proposals and Shape Reconstruction [CVPR2019][Pytorch]
  • [FQNet] Deep fitting degree scoring network for monocular 3d object detection [CVPR2019]
  • [ROI-10D] ROI-10D: Monocular Lifting of 2D Detection to 6D Pose and Metric Shape [CVPR2019]
  • [GS3D] GS3D: An Efficient 3D Object Detection Framework for Autonomous Driving [CVPR2019]
  • [Pseudo-LiDAR] Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving [CVPR2019][Pytorch]
  • [BirdGAN] Learning 2D to 3D Lifting for Object Detection in 3D for Autonomous Vehicles [IROS2019]
  • [MonoGRNet] MonoGRNet: A Geometric Reasoning Network for Monocular 3D Object Localization [AAAI2019(oral)][Tensorflow]
  • [OFT-Net] Orthographic feature transform for monocular 3d object detection [BMVC2019][Pytorch]
  • [Shift R-CNN] Shift R-CNN: Deep Monocular 3D Object Detection with Closed-Form Geometric Constraints [TIP2019]
  • [SS3D] SS3D: Monocular 3d object detection and box fitting trained end-to-end using intersection-over-union loss [Arxiv2019]

2018

  • [Multi-Fusion] Multi-Level Fusion based 3D Object Detection from Monocular Images [CVPR2018][Pytorch]
  • [Mono3D++] Mono3D++: Monocular 3D Vehicle Detection with Two-Scale 3D Hypotheses and Task Priors [AAAI2018]

2017

  • [Deep3DBox] 3D Bounding Box Estimation Using Deep Learning and Geometry [CVPR2017][Pytorch][Tensorflow]
  • [Deep MANTA] Deep MANTA: A Coarse-to-fine Many-Task Network for joint 2D and 3D vehicle analysis from monocular image [CVPR2017]

2016

  • [Mono3D] Monocular 3D object detection for autonomous driving [CVPR2016]

KITTI Results

Method Extra Test, AP3D|R40 Val, AP3D|R40 Val, AP3D|R11 Reference
Easy Mod. Hard Easy Mod. Hard Easy Mod. Hard
MonoRUn Lidar 19.65 12.30 10.58 20.02 14.65 12.61 - - - CVPR2021
CaDDN Lidar 19.17 13.41 11.46 23.57 16.31 13.84 - - - CVPR2021
AM3D Depth 16.50 10.74 9.52 28.31 15.76 12.24 32.23 21.09 17.26 ICCV2019
PatchNet Depth 15.68 11.12 10.17 31.60 16.80 13.80 35.10 22.00 19.60 ECCV2020
D4LCN Depth 16.65 11.72 9.51 22.32 16.20 12.30 26.97 21.72 18.22 CVPRW2020
DFR-Net Depth 19.40 13.63 10.35 24.81 17.78 14.41 28.80 22.88 19.47 ICCV2021
M3D-RPN None 14.76 9.71 7.42 14.53 11.07 8.65 20.27 17.06 15.21 ICCV2019
SMOKE None 14.03 9.76 7.84 - - - 14.76 12.85 11.50 CVPRW2020
MonoPair None 13.04 9.99 8.65 16.28 12.30 10.42 - - - CVPR2020
RTM3D None 14.41 10.34 8.77 - - - 20.77 16.86 16.63 ECCV2020
M3DSSD None 17.51 11.46 8.98 - - - 27.77 21.67 18.28 CVPR2021
Monoflex None 19.94 13.89 12.07 23.64 17.51 14.83 28.17 21.92 19.07 CVPR2021
GUPNet None 20.11 14.20 11.77 22.76 16.46 13.72 - - - ICCV2021
MonoCon None 22.50 16.46 13.95 26.33 19.01 15.98 - - - AAAI2022
Owner
Zhikang Zou
Baidu Inc.
Zhikang Zou
A curated list of awesome Active Learning

Awesome Active Learning 🤩 A curated list of awesome Active Learning ! 🤩 Background (image source: Settles, Burr) What is Active Learning? Active lea

BAI Fan 431 Jan 03, 2023
This is an implementation for the CVPR2020 paper "Learning Invariant Representation for Unsupervised Image Restoration"

Learning Invariant Representation for Unsupervised Image Restoration (CVPR 2020) Introduction This is an implementation for the paper "Learning Invari

GarField 88 Nov 07, 2022
Sleep staging from ECG, assisted with EEG

Sleep_Staging_Knowledge Distillation This codebase implements knowledge distillation approach for ECG based sleep staging assisted by EEG based sleep

2 Dec 12, 2022
Training and Evaluation Code for Neural Volumes

Neural Volumes This repository contains training and evaluation code for the paper Neural Volumes. The method learns a 3D volumetric representation of

Meta Research 370 Dec 08, 2022
FFCV: Fast Forward Computer Vision (and other ML workloads!)

Fast Forward Computer Vision: train models at a fraction of the cost with accele

FFCV 2.3k Jan 03, 2023
Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track (SIGIR 2021 Full Paper).

Optimizing Dense Retrieval Model Training with Hard Negatives Jingtao Zhan, Jiaxin Mao, Yiqun Liu, Jiafeng Guo, Min Zhang, Shaoping Ma This repo provi

Jingtao Zhan 99 Dec 27, 2022
Mixed Transformer UNet for Medical Image Segmentation

MT-UNet Update 2021/11/19 Thank you for your interest in our work. We have uploaded the code of our MTUNet to help peers conduct further research on i

dotman 92 Dec 25, 2022
AQP is a modular pipeline built to enable the comparison and testing of different quality metric configurations.

Audio Quality Platform - AQP An Open Modular Python Platform for Objective Speech and Audio Quality Metrics AQP is a highly modular pipeline designed

Jack Geraghty 24 Oct 01, 2022
Code for IntraQ, PyTorch implementation of our paper under review

IntraQ: Learning Synthetic Images with Intra-Class Heterogeneity for Zero-Shot Network Quantization paper Requirements Python = 3.7.10 Pytorch == 1.7

1 Nov 19, 2021
Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP

mmc installation git clone https://github.com/dmarx/Multi-Modal-Comparators cd 'Multi-Modal-Comparators' pip install poetry poetry build pip install d

David Marx 37 Nov 25, 2022
LaneDetectionAndLaneKeeping - Lane Detection And Lane Keeping

LaneDetectionAndLaneKeeping This project is part of my bachelor's thesis. The go

5 Jun 27, 2022
Differentiable simulation for system identification and visuomotor control

gradsim gradSim: Differentiable simulation for system identification and visuomotor control gradSim is a unified differentiable rendering and multiphy

105 Dec 18, 2022
A decent AI that solves daily Wordle puzzles. Works with different websites with similar wordlists,.

Wordle-AI A decent AI that solves daily "Wordle" puzzles. Works with different websites with similar wordlists. When prompted with "Word:" enter the w

Ethan 1 Feb 10, 2022
WTTE-RNN a framework for churn and time to event prediction

WTTE-RNN Weibull Time To Event Recurrent Neural Network A less hacky machine-learning framework for churn- and time to event prediction. Forecasting p

Egil Martinsson 727 Dec 28, 2022
Pyramid Pooling Transformer for Scene Understanding

Pyramid Pooling Transformer for Scene Understanding Requirements: torch 1.6+ torchvision 0.7.0 timm==0.3.2 Validated on torch 1.6.0, torchvision 0.7.0

Yu-Huan Wu 119 Dec 29, 2022
Real life contra a deep learning project built using mediapipe and openc

real-life-contra Description A python script that translates the body movement into in game control. Welcome to all new real life contra a deep learni

Programminghut 7 Jan 26, 2022
ECCV2020 paper: Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code and Data.

This repo contains some of the codes for the following paper Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code

Xuewen Yang 56 Dec 08, 2022
List of papers, code and experiments using deep learning for time series forecasting

Deep Learning Time Series Forecasting List of state of the art papers focus on deep learning and resources, code and experiments using deep learning f

Alexander Robles 2k Jan 06, 2023
dataset for ECCV 2020 "Motion Capture from Internet Videos"

Motion Capture from Internet Videos Motion Capture from Internet Videos Junting Dong*, Qing Shuai*, Yuanqing Zhang, Xian Liu, Xiaowei Zhou, Hujun Bao

ZJU3DV 98 Dec 07, 2022
BridgeGAN - Tensorflow implementation of Bridging the Gap between Label- and Reference-based Synthesis in Multi-attribute Image-to-Image Translation.

Bridging the Gap between Label- and Reference based Synthesis(ICCV 2021) Tensorflow implementation of Bridging the Gap between Label- and Reference-ba

huangqiusheng 8 Jul 13, 2022