Object DGCNN & DETR3D

This repo contains the implementations of Object DGCNN (https://arxiv.org/abs/2110.06923) and DETR3D (https://arxiv.org/abs/2110.06922). Our implementations are built on top of MMdetection3D.

Prerequisite

mmcv (https://github.com/open-mmlab/mmcv)
mmdet (https://github.com/open-mmlab/mmdetection)
mmseg (https://github.com/open-mmlab/mmsegmentation)
mmdet3d (https://github.com/open-mmlab/mmdetection3d)

Data

Follow the mmdet3d to process the data.

Train

Downloads the pretrained backbone weights to pretrained/
For example, to train Object-DGCNN with pillar on 8 GPUs, please use

tools/dist_train.sh projects/configs/obj_dgcnn/pillar.py 8

Evaluation using pretrained models

Download the weights accordingly.

Backbone	mAP	NDS	Download
DETR3D, ResNet101 w/ DCN	34.7	42.2	model \| log
above, + CBGS	34.9	43.4	model \| log
DETR3D, VoVNet on trainval, evaluation on test set	41.2	47.9	model \| log

Backbone	mAP	NDS	Download
Object DGCNN, pillar	53.2	62.8	model \| log
Object DGCNN, voxel	58.6	66.0	model \| log

To test, use
tools/dist_test.sh projects/configs/obj_dgcnn/pillar_cosine.py /path/to/ckpt 8 --eval=bbox

If you find this repo useful for your research, please consider citing the papers

@inproceedings{
   obj-dgcnn,
   title={Object DGCNN: 3D Object Detection using Dynamic Graphs},
   author={Wang, Yue and Solomon, Justin M.},
   booktitle={2021 Conference on Neural Information Processing Systems ({NeurIPS})},
   year={2021}
}

@inproceedings{
   detr3d,
   title={DETR3D: 3D Object Detection from Multi-view Images via 3D-to-2D Queries},
   author={Wang, Yue and Guizilini, Vitor and Zhang, Tianyuan and Wang, Yilun and Zhao, Hang and and Solomon, Justin M.},
   booktitle={The Conference on Robot Learning ({CoRL})},
   year={2021}
}

Object DGCNN and DETR3D, Our implementations are built on top of MMdetection3D.

Related tags

Overview

Object DGCNN & DETR3D

Prerequisite

Data

Train

Evaluation using pretrained models

Owner

Wang, Yue

Fully Automatic Page Turning on Real Scores

Hough Transform and Hough Line Transform Using OpenCV

Official repository for "On Generating Transferable Targeted Perturbations" (ICCV 2021)

Official repository of the paper Privacy-friendly Synthetic Data for the Development of Face Morphing Attack Detectors

Program your own vulkan.gpuinfo.org query in Python. Used to determine baseline hardware for WebGPU.

A repository for the paper "Improved Adversarial Systems for 3D Object Generation and Reconstruction".

Uncertainty-aware Semantic Segmentation of LiDAR Point Clouds for Autonomous Driving

PyTorch Implementation for AAAI'21 "Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-turn Response Selection"

Sign Language is detected in realtime using video sequences. Our approach involves MediaPipe Holistic for keypoints extraction and LSTM Model for prediction.

A PaddlePaddle version image model zoo.

[ICLR'21] Counterfactual Generative Networks

Unofficial implementation of "TTNet: Real-time temporal and spatial video analysis of table tennis" (CVPR 2020)

A series of Python scripts to access measurements from Fluke 28X meters. Fluke IR Remote Interface required.

NeuralForecast is a Python library for time series forecasting with deep learning models

Open source repository for the code accompanying the paper 'PatchNets: Patch-Based Generalizable Deep Implicit 3D Shape Representations'.

Official implementation of "Implicit Neural Representations with Periodic Activation Functions"

Official codebase for ICLR oral paper Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling

A PyTorch implementation of "SimGNN: A Neural Network Approach to Fast Graph Similarity Computation" (WSDM 2019).

The Dual Memory is build from a simple CNN for the deep memory and Linear Regression fro the fast Memory

This repository is the offical Pytorch implementation of ContextPose: Context Modeling in 3D Human Pose Estimation: A Unified Perspective (CVPR 2021).