Learning Calibrated-Guidance for Object Detection in Aerial Images

Last update: Sep 22, 2022

Related tags

Overview

Learning Calibrated-Guidance for Object Detection in Aerial Images arxiv

We propose a simple yet effective Calibrated-Guidance (CG) scheme to enhance channel communications in a feature transformer fashion, which can adaptively determine the calibration weights for each channel based on the global feature affinity-pairs. Specifically, given a set of feature maps, CG first computes the feature similarity between each channel and the remaining channels as the intermediary calibration guidance. Then, re-representing each channel by aggregating all the channels weighted together via the guidance. Our CG can be plugged into any deep neural network, which is named as CG-Net. To demonstrate its effectiveness and efficiency, extensive experiments are carried out on both oriented and horizontal object detection tasks of aerial images. Results on two challenging benchmarks (i.e., DOTA and HRSC2016) demonstrate that our CG-Net can achieve state-of-the-art performance in accuracy with a fair computational overhead.

Introduction

This codebase is created to build benchmarks for object detection in aerial images. It is modified from mmdetection. The master branch works with PyTorch 1.1 or higher. If you would like to use PyTorch 0.4.1, please checkout to the pytorch-0.4.1 branch.

Results

Visualization results for oriented object detection on the test set of DOTA.

Comparison to the baseline on DOTA for oriented object detection with ResNet-101. The figures with blue boxes are the results of the baseline and pink boxes are the results of our proposed CG-Net.

Experiment

ImageNet Pretrained Model from Pytorch

The effectiveness of our proposed methods with different backbone network on the test of DOTA.

Backbone	+CG	Weight	mAP(%)
ResNet-50		download	73.26
ResNet-50	+	download	74.21
ResNet-101		download	73.06
ResNet-101	+	download	74.30
ResNet-152		download	72.78
ResNet-152	+	download	73.53

CG-Net Results in DOTA.

Backbone	Aug Rotate	Task	Weight	mAP(%)
ResNet-101	+	Oriented	download	77.89
ResNet-101	+	Horizontal	download	78.26

Installation

Please refer to INSTALL.md for installation.

Get Started

Please see GETTING_STARTED.md for the basic usage of mmdetection.

Contributing

We appreciate all contributions to improve benchmarks for object detection in aerial images.

Citing

If you use our work, please consider citing:

@InProceedings{liang2021learning,
      title={Learning Calibrated-Guidance for Object Detection in Aerial Images}, 
      author={Dong, Liang and Zongqi, Wei and Dong, Zhang and Qixiang, Geng and Liyan, Zhang and Han, Sun and Huiyu, Zhou and Mingqiang, Wei and Pan, Gao},
      booktitle ={arXiv:2103.11399},
      year={2021}
}

Thanks to the Third Party Libs

Pytorch

mmdetection

AerialDetection

Learning Calibrated-Guidance for Object Detection in Aerial Images

Related tags

Overview

Learning Calibrated-Guidance for Object Detection in Aerial Images arxiv

Introduction

Results

Experiment

Installation

Get Started

Contributing

Citing

Thanks to the Third Party Libs

Owner

We utilize deep reinforcement learning to obtain favorable trajectories for visual-inertial system calibration.

[Preprint] "Bag of Tricks for Training Deeper Graph Neural Networks A Comprehensive Benchmark Study" by Tianlong Chen, Kaixiong Zhou, Keyu Duan, Wenqing Zheng, Peihao Wang, Xia Hu, Zhangyang Wang

RoMA: Robust Model Adaptation for Offline Model-based Optimization

Code for sound field predictions in domains with impedance boundaries. Used for generating results from the paper

Single object tracking and segmentation.

Pre-trained NFNets with 99% of the accuracy of the official paper

Very deep VAEs in JAX/Flax

Simple STAC Catalogs discovery tool.

Code for HLA-Face: Joint High-Low Adaptation for Low Light Face Detection (CVPR21)

TCube generates rich and fluent narratives that describes the characteristics, trends, and anomalies of any time-series data (domain-agnostic) using the transfer learning capabilities of PLMs.

Baseline powergrid model for NY

GAN-generated image detection based on CNNs

Scaling Vision with Sparse Mixture of Experts

A collection of loss functions for medical image segmentation

Benchmark for the generalization of 3D machine learning models across different remeshing/samplings of a surface.

Artstation-Artistic-face-HQ Dataset (AAHQ)

Constructing interpretable quadratic accuracy predictors to serve as an objective function for an IQCQP problem that represents NAS under latency constraints and solve it with efficient algorithms.

The official PyTorch implementation for the paper "sMGC: A Complex-Valued Graph Convolutional Network via Magnetic Laplacian for Directed Graphs".

DSEE: Dually Sparsity-embedded Efficient Tuning of Pre-trained Language Models

SwinIR: Image Restoration Using Swin Transformer

Learning Calibrated-Guidance for Object Detection in Aerial Images

Related tags

Overview

Learning Calibrated-Guidance for Object Detection in Aerial Images arxiv

Introduction

Results

Experiment

Installation

Get Started

Contributing

Citing

Thanks to the Third Party Libs

Owner

We utilize deep reinforcement learning to obtain favorable trajectories for visual-inertial system calibration.

[Preprint] "Bag of Tricks for Training Deeper Graph Neural Networks A Comprehensive Benchmark Study" by Tianlong Chen*, Kaixiong Zhou*, Keyu Duan, Wenqing Zheng, Peihao Wang, Xia Hu, Zhangyang Wang

RoMA: Robust Model Adaptation for Offline Model-based Optimization

Code for sound field predictions in domains with impedance boundaries. Used for generating results from the paper

Single object tracking and segmentation.

Pre-trained NFNets with 99% of the accuracy of the official paper

Very deep VAEs in JAX/Flax

Simple STAC Catalogs discovery tool.

Code for HLA-Face: Joint High-Low Adaptation for Low Light Face Detection (CVPR21)

TCube generates rich and fluent narratives that describes the characteristics, trends, and anomalies of any time-series data (domain-agnostic) using the transfer learning capabilities of PLMs.

Baseline powergrid model for NY

GAN-generated image detection based on CNNs

Scaling Vision with Sparse Mixture of Experts

A collection of loss functions for medical image segmentation

Benchmark for the generalization of 3D machine learning models across different remeshing/samplings of a surface.

Artstation-Artistic-face-HQ Dataset (AAHQ)

Constructing interpretable quadratic accuracy predictors to serve as an objective function for an IQCQP problem that represents NAS under latency constraints and solve it with efficient algorithms.

The official PyTorch implementation for the paper "sMGC: A Complex-Valued Graph Convolutional Network via Magnetic Laplacian for Directed Graphs".

DSEE: Dually Sparsity-embedded Efficient Tuning of Pre-trained Language Models

SwinIR: Image Restoration Using Swin Transformer

[Preprint] "Bag of Tricks for Training Deeper Graph Neural Networks A Comprehensive Benchmark Study" by Tianlong Chen, Kaixiong Zhou, Keyu Duan, Wenqing Zheng, Peihao Wang, Xia Hu, Zhangyang Wang