Monocular 3D Object Detection: An Extrinsic Parameter Free Approach (CVPR2021)

Yunsong Zhou, Yuan He, Hongzi Zhu, Cheng Wang, Hongyang Li, Qinhong Jiang

Our paper is now avaiable on CVPR 2021 open access.

Introduction

Our framework is implemented and tested with Ubuntu 16.04, CUDA 8.0/9.0, Python 3, Pytorch 0.4/1.0/1.1, NVIDIA Tesla V100/TITANX GPU.

If you find our work useful in your research please consider citing our paper:

@InProceedings{Zhou_2021_CVPR,
author    = {Zhou, Yunsong and He, Yuan and Zhu, Hongzi and Wang, Cheng and Li, Hongyang and Jiang, Qinhong},
title     = {Monocular 3D Object Detection: An Extrinsic Parameter Free Approach},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month     = {June},
year      = {2021},
pages     = {7556-7566}
}

Requirements

Cuda & Cudnn & Python & Pytorch

This project is tested with CUDA 8.0/9.0, Python 3, Pytorch 0.4/1.0/1.1, NVIDIA Tesla V100/TITANX GPU. And almost all the packages we use are covered by Anaconda.

Please install proper CUDA and CUDNN version, and then install Anaconda3 and Pytorch.

Data preparation

Download and unzip the full KITTI detection dataset.

Training

I am currently busy with my own courses. I will sort out the work involved in the near future. Relevant code and models will be avaiable soon.

Monocular 3D Object Detection: An Extrinsic Parameter Free Approach (CVPR2021)

Related tags

Overview

Monocular 3D Object Detection: An Extrinsic Parameter Free Approach (CVPR2021)

Introduction

Requirements

Data preparation

Training

Owner

Yunsong Zhou

Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations

DirectVoxGO reconstructs a scene representation from a set of calibrated images capturing the scene.

LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference

"SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image", Dejia Xu, Yifan Jiang, Peihao Wang, Zhiwen Fan, Humphrey Shi, Zhangyang Wang

Image Segmentation Animation using Quadtree concepts.

The code for MM2021 paper "Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning"

Code artifacts for the submission "Mind the Gap! A Study on the Transferability of Virtual vs Physical-world Testing of Autonomous Driving Systems"

Unifying Global-Local Representations in Salient Object Detection with Transformer

TabNet for fastai

TorchDistiller - a collection of the open source pytorch code for knowledge distillation, especially for the perception tasks, including semantic segmentation, depth estimation, object detection and instance segmentation.

Official implementation of the paper DeFlow: Learning Complex Image Degradations from Unpaired Data with Conditional Flows

Multi-Task Learning as a Bargaining Game

Pytorch implementation of "Neural Wireframe Renderer: Learning Wireframe to Image Translations"

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.

RM Operation can equivalently convert ResNet to VGG, which is better for pruning; and can help RepVGG perform better when the depth is large.

BasicNeuralNetwork - This project looks over the basic structure of a neural network and how machine learning training algorithms work

Pytorch implementation of face attention network

Code for: Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification

What can linearized neural networks actually say about generalization?

PINN(s): Physics-Informed Neural Network(s) for von Karman vortex street