An official implementation of the Anchor DETR.

Last update: Dec 28, 2022

Related tags

Overview

Anchor DETR: Query Design for Transformer-Based Detector

Introduction

This repository is an official implementation of the Anchor DETR. We encode the anchor points as the object queries in DETR. Multiple patterns are attached to each anchor point to solve the difficulty: "one region, multiple objects". We also propose an attention variant RCDA to reduce the memory cost for high-resolution features.

Main Results

	feature	epochs	AP	GFLOPs	Infer Speed (FPS)
DETR	DC5	500	43.3	187	10 (12)
SMCA	multi-level	50	43.7	152	10
Deformable DETR	multi-level	50	43.8	173	15
Conditional DETR	DC5	50	43.8	195	10
Anchor DETR	DC5	50	44.3	151	16 (19)

Note:

The results are based on ResNet-50 backbone.
Inference speeds are measured on NVIDIA Tesla V100 GPU.
The values in parentheses of the Infer Speed indicate the speed with torchscript optimization.

Model

name	backbone	AP	URL
AnchorDETR-C5	R50	42.1	model / log
AnchorDETR-DC5	R50	44.3	model / log
AnchorDETR-C5	R101	43.5	model / log
AnchorDETR-DC5	R101	45.1	model / log

Note: the models and logs are also available at Baidu Netdisk with code hh13.

Usage

Installation

First, clone the repository locally:

git clone https://github.com/megvii-research/AnchorDETR.git

Then, install dependencies:

pip install -r requirements.txt

Training

To train AnchorDETR on a single node with 8 GPUs:

python -m torch.distributed.launch --nproc_per_node=8 --use_env main.py  --coco_path /path/to/coco

Evaluation

To evaluate AnchorDETR on a single node with 8 GPUs:

python -m torch.distributed.launch --nproc_per_node=8 --use_env main.py --eval --coco_path /path/to/coco --resume /path/to/checkpoint.pth

To evaluate AnchorDETR with a single GPU:

python main.py --eval --coco_path /path/to/coco --resume /path/to/checkpoint.pth

Citation

If you find this project useful for your research, please consider citing the paper.

@misc{wang2021anchor,
      title={Anchor DETR: Query Design for Transformer-Based Detector},
      author={Yingming Wang and Xiangyu Zhang and Tong Yang and Jian Sun},
      year={2021},
      eprint={2109.07107},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Contact

If you have any questions, feel free to open an issue or contact us at [email protected].

An official implementation of the Anchor DETR.

Related tags

Overview

Anchor DETR: Query Design for Transformer-Based Detector

Introduction

Main Results

Model

Usage

Installation

Training

Evaluation

Citation

Contact

Owner

MEGVII Research

Code for "Learning From Multiple Experts: Self-paced Knowledge Distillation for Long-tailed Classification", ECCV 2020 Spotlight

AQP is a modular pipeline built to enable the comparison and testing of different quality metric configurations.

Official PyTorch Implementation of Convolutional Hough Matching Networks, CVPR 2021 (oral)

The-Secret-Sharing-Schemes - This interactive script demonstrates the Secret Sharing Schemes algorithm

FcaNet: Frequency Channel Attention Networks

Enabling dynamic analysis of Legacy Embedded Systems in full emulated environment

NLP made easy

Cognition-aware Cognate Detection

YOLOX-RMPOLY

Vehicle direction identification consists of three module detection , tracking and direction recognization.

Data from "HateCheck: Functional Tests for Hate Speech Detection Models" (Röttger et al., ACL 2021)

Bi-level feature alignment for versatile image translation and manipulation (Under submission of TPAMI)

Stroke-predictions-ml-model - Machine learning model to predict individuals chances of having a stroke

Simple STAC Catalogs discovery tool.

A PyTorch implementation of "Semi-Supervised Graph Classification: A Hierarchical Graph Perspective" (WWW 2019)

Cleaned test data list of DukeMTMC-reID, ICCV2021

A small tool to joint picture including gif

Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]

Hierarchical Few-Shot Generative Models

Global Rhythm Style Transfer Without Text Transcriptions