Learning Calibrated-Guidance for Object Detection in Aerial Images

Last update: Sep 22, 2022

Related tags

Overview

Learning Calibrated-Guidance for Object Detection in Aerial Images arxiv

We propose a simple yet effective Calibrated-Guidance (CG) scheme to enhance channel communications in a feature transformer fashion, which can adaptively determine the calibration weights for each channel based on the global feature affinity-pairs. Specifically, given a set of feature maps, CG first computes the feature similarity between each channel and the remaining channels as the intermediary calibration guidance. Then, re-representing each channel by aggregating all the channels weighted together via the guidance. Our CG can be plugged into any deep neural network, which is named as CG-Net. To demonstrate its effectiveness and efficiency, extensive experiments are carried out on both oriented and horizontal object detection tasks of aerial images. Results on two challenging benchmarks (i.e., DOTA and HRSC2016) demonstrate that our CG-Net can achieve state-of-the-art performance in accuracy with a fair computational overhead.

Introduction

This codebase is created to build benchmarks for object detection in aerial images. It is modified from mmdetection. The master branch works with PyTorch 1.1 or higher. If you would like to use PyTorch 0.4.1, please checkout to the pytorch-0.4.1 branch.

Results

Visualization results for oriented object detection on the test set of DOTA.

Comparison to the baseline on DOTA for oriented object detection with ResNet-101. The figures with blue boxes are the results of the baseline and pink boxes are the results of our proposed CG-Net.

Experiment

ImageNet Pretrained Model from Pytorch

The effectiveness of our proposed methods with different backbone network on the test of DOTA.

Backbone	+CG	Weight	mAP(%)
ResNet-50		download	73.26
ResNet-50	+	download	74.21
ResNet-101		download	73.06
ResNet-101	+	download	74.30
ResNet-152		download	72.78
ResNet-152	+	download	73.53

CG-Net Results in DOTA.

Backbone	Aug Rotate	Task	Weight	mAP(%)
ResNet-101	+	Oriented	download	77.89
ResNet-101	+	Horizontal	download	78.26

Installation

Please refer to INSTALL.md for installation.

Get Started

Please see GETTING_STARTED.md for the basic usage of mmdetection.

Contributing

We appreciate all contributions to improve benchmarks for object detection in aerial images.

Citing

If you use our work, please consider citing:

@InProceedings{liang2021learning,
      title={Learning Calibrated-Guidance for Object Detection in Aerial Images}, 
      author={Dong, Liang and Zongqi, Wei and Dong, Zhang and Qixiang, Geng and Liyan, Zhang and Han, Sun and Huiyu, Zhou and Mingqiang, Wei and Pan, Gao},
      booktitle ={arXiv:2103.11399},
      year={2021}
}

Thanks to the Third Party Libs

Pytorch

mmdetection

AerialDetection

Learning Calibrated-Guidance for Object Detection in Aerial Images

Related tags

Overview

Learning Calibrated-Guidance for Object Detection in Aerial Images arxiv

Introduction

Results

Experiment

Installation

Get Started

Contributing

Citing

Thanks to the Third Party Libs

Owner

Data-driven reduced order modeling for nonlinear dynamical systems

Official implementation for Scale-Aware Neural Architecture Search for Multivariate Time Series Forecasting

A High-Level Fusion Scheme for Circular Quantities published at the 20th International Conference on Advanced Robotics

LAVT: Language-Aware Vision Transformer for Referring Image Segmentation

Dynamic View Synthesis from Dynamic Monocular Video

[NeurIPS2021] Code Release of Learning Transferable Perturbations

Official implementation of the ICCV 2021 paper "Conditional DETR for Fast Training Convergence".

PyTorch deep learning projects made easy.

Continuous Query Decomposition for Complex Query Answering in Incomplete Knowledge Graphs

LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT

Developing your First ML Workflow of the AWS Machine Learning Engineer Nanodegree Program

Trading environnement for RL agents, backtesting and training.

This repository contains code to run experiments in the paper "Signal Strength and Noise Drive Feature Preference in CNN Image Classifiers."

Check out the StyleGAN repo and place it in the same directory hierarchy as the present repo

Consumer Fairness in Recommender Systems: Contextualizing Definitions and Mitigations

Code for the ICML 2021 paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

A Learning-based Camera Calibration Toolbox

Synthesize photos from PhotoDNA using machine learning 🌱

A Pytorch Implementation of ClariNet

Code reproduce for paper "Vehicle Re-identification with Viewpoint-aware Metric Learning"