Benchmarks for Object Detection in Aerial Images

Last update: Dec 30, 2022

Related tags

Overview

Benchmarks for Object Detection in Aerial Images

Introduction

This codebase is created to build benchmarks for object detection in aerial images. It is modified from mmdetection. The master branch works with PyTorch 1.1 or higher. If you would like to use PyTorch 0.4.1, please checkout to the pytorch-0.4.1 branch.

Main Features

To adapt to object detection in aerial images, this repo has several unique and new features compared to the original mmdetection

Support Oriented Object Detection

In aerial images, objects are usually annotated by oriented bounding box (OBB). To support oriented object detection, we implement OBB Head (OBBRoIHead and OBBDenseHead). Also, we provide functions to transfer mask predictions to OBBs.
Cython Bbox Overlaps

Since one patch image with the size of 1024 × 1024 may contain over 1000 instances in DOTA, which make the bbox overlaps memroy consuming. To avoid out of GPU memory, we calculate the bbox overlaps in cython. The speed of cython version is close to the GPU version.
Rotation Augmentation

Since there are many orientation variations in aerial images, we implement the online rotation augmentation.
Rotated RoI Warping

Currently, we implement two types of rotated RoI Warping (Rotated RoI Align and Rotated Position Sensitive RoI Align).

License

This project is released under the Apache 2.0 license.

Benchmark and model zoo

Results are available in the Model zoo.
You can find the detailed configs in configs/DOTA.
The trained models are available at Google Drive or Baidu Drive.

Installation

Please refer to INSTALL.md for installation.

Get Started

Please see GETTING_STARTED.md for the basic usage of mmdetection.

Contributing

We appreciate all contributions to improve benchmarks for object detection in aerial images.

Citing

If you use DOTA dataset, codebase or models in your research, please consider cite .

@misc{ding2021object,
      title={Object Detection in Aerial Images: A Large-Scale Benchmark and Challenges}, 
      author={Jian Ding and Nan Xue and Gui-Song Xia and Xiang Bai and Wen Yang and Micheal Ying Yang and Serge Belongie and Jiebo Luo and Mihai Datcu and Marcello Pelillo and Liangpei Zhang},
      year={2021},
      eprint={2102.12219},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}
@inproceedings{xia2018dota,
  title={DOTA: A large-scale dataset for object detection in aerial images},
  author={Xia, Gui-Song and Bai, Xiang and Ding, Jian and Zhu, Zhen and Belongie, Serge and Luo, Jiebo and Datcu, Mihai and Pelillo, Marcello and Zhang, Liangpei},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  pages={3974--3983},
  year={2018}
}

@article{chen2019mmdetection,
  title={MMDetection: Open mmlab detection toolbox and benchmark},
  author={Chen, Kai and Wang, Jiaqi and Pang, Jiangmiao and Cao, Yuhang and Xiong, Yu and Li, Xiaoxiao and Sun, Shuyang and Feng, Wansen and Liu, Ziwei and Xu, Jiarui and others},
  journal={arXiv preprint arXiv:1906.07155},
  year={2019}
}

@InProceedings{Ding_2019_CVPR,
author = {Ding, Jian and Xue, Nan and Long, Yang and Xia, Gui-Song and Lu, Qikai},
title = {Learning RoI Transformer for Oriented Object Detection in Aerial Images},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2019}
}

Thanks to the Third Party Libs

Pytorch

mmdetection

Benchmarks for Object Detection in Aerial Images

Related tags

Overview

Benchmarks for Object Detection in Aerial Images

Introduction

Main Features

License

Benchmark and model zoo

Installation

Get Started

Contributing

Citing

Thanks to the Third Party Libs

Owner

Jian Ding

THIS IS THE OLD PYMC PROJECT. PLEASE USE PYMC3 INSTEAD:

Large dataset storage format for Pytorch

RoboDesk A Multi-Task Reinforcement Learning Benchmark

Detection of drones using their thermal signatures from thermal camera through YOLO-V3 based CNN with modifications to encapsulate drone motion

Content shared at DS-OX Meetup

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with ONNX, TensorRT, ncnn, and OpenVINO supported.

Official Pytorch Implementation for Splicing ViT Features for Semantic Appearance Transfer presenting Splice

A PyTorch implementation for V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation

Code for "SRHEN: Stepwise-Refining Homography Estimation Network via Parsing Geometric Correspondences in Deep Latent Space"

Image transformations designed for Scene Text Recognition (STR) data augmentation. Published at ICCV 2021 Workshop on Interactive Labeling and Data Augmentation for Vision.

Classification of Long Sequential Data using Circular Dilated Convolutional Neural Networks

PyTorch implementation of Spiking Neural Networks trained on surrogate gradient & BPTT using snntorch.

Cowsay - A rewrite of cowsay in python

An OpenAI-Gym Package for Training and Testing Reinforcement Learning algorithms with OpenSim Models

This repo contains the official code and pre-trained models for the Dynamic Vision Transformer (DVT).

Parameter-ensemble-differential-evolution - Shows how to do parameter ensembling using differential evolution.

Repository for paper "Non-intrusive speech intelligibility prediction from discrete latent representations"

Hide screen when boss is approaching.

Code for our NeurIPS 2021 paper: Sparsely Changing Latent States for Prediction and Planning in Partially Observable Domains

Barbershop: GAN-based Image Compositing using Segmentation Masks (SIGGRAPH Asia 2021)

Benchmarks for Object Detection in Aerial Images

Related tags

Overview

Benchmarks for Object Detection in Aerial Images

Introduction

Main Features

License

Benchmark and model zoo

Installation

Get Started

Contributing

Citing

Thanks to the Third Party Libs

Owner

Jian Ding

THIS IS THE **OLD** PYMC PROJECT. PLEASE USE PYMC3 INSTEAD:

Large dataset storage format for Pytorch

RoboDesk A Multi-Task Reinforcement Learning Benchmark

Detection of drones using their thermal signatures from thermal camera through YOLO-V3 based CNN with modifications to encapsulate drone motion

Content shared at DS-OX Meetup

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with ONNX, TensorRT, ncnn, and OpenVINO supported.

Official Pytorch Implementation for Splicing ViT Features for Semantic Appearance Transfer presenting Splice

A PyTorch implementation for V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation

Code for "SRHEN: Stepwise-Refining Homography Estimation Network via Parsing Geometric Correspondences in Deep Latent Space"

Image transformations designed for Scene Text Recognition (STR) data augmentation. Published at ICCV 2021 Workshop on Interactive Labeling and Data Augmentation for Vision.

Classification of Long Sequential Data using Circular Dilated Convolutional Neural Networks

PyTorch implementation of Spiking Neural Networks trained on surrogate gradient & BPTT using snntorch.

Cowsay - A rewrite of cowsay in python

An OpenAI-Gym Package for Training and Testing Reinforcement Learning algorithms with OpenSim Models

This repo contains the official code and pre-trained models for the Dynamic Vision Transformer (DVT).

Parameter-ensemble-differential-evolution - Shows how to do parameter ensembling using differential evolution.

Repository for paper "Non-intrusive speech intelligibility prediction from discrete latent representations"

Hide screen when boss is approaching.

Code for our NeurIPS 2021 paper: Sparsely Changing Latent States for Prediction and Planning in Partially Observable Domains

Barbershop: GAN-based Image Compositing using Segmentation Masks (SIGGRAPH Asia 2021)

THIS IS THE OLD PYMC PROJECT. PLEASE USE PYMC3 INSTEAD: