SwinTransformer + OBBDet

The sixth place winning solution (6/220) in the track of Fine-grained Object Recognition in High-Resolution Optical Images, 2021 Gaofen Challenge on Automated High-Resolution Earth Observation Image Interpretation.

Members

Qi Ming, Junjie Song, Yunpeng Dong.

Solution

Off-line date augmentation
We use random combination of affine transformation, flip, scaling, optical distortion for data augmentation.
Multi-scale training and testing
The training images are resized into sizes of 600, 800, and 1024 for training and testing.
Strong backbone
Swin transformer is adopt in ORCNN and RoI Transformer for better performance.
Model ensemble
We have merged the results from RoI Transformer, ORCNN, S2ANet, and ReDet.
Lower confidence
Set the output threshold into 0.005.

Tried but didn't work

Soft-NMS.
Adjust NMS threshold.
Class-agnostic NMS.
Mosaic, and mix up for data augmentation.
Oversample the categories with fewer instances.
Train the detectors for specific classes with low AP.
Multi-scale training and testing on SwinTransformer-based detectors (even dropped by about 1% mAP).

The sixth place winning solution (6/220) in 2021 Gaofen Challenge.

Related tags

Overview

SwinTransformer + OBBDet

Members

Solution

Tried but didn't work

Detections

Owner

ming71

Pytorch implementation of the DeepDream computer vision algorithm

Lightwood is Legos for Machine Learning.

General purpose GPU compute framework for cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends)

Reverse engineer your pytorch vision models, in style

Mix3D: Out-of-Context Data Augmentation for 3D Scenes (3DV 2021)

Convert game ISO and archives to CD CHD for emulation on Linux.

SCALoss: Side and Corner Aligned Loss for Bounding Box Regression (AAAI2022).

A Partition Filter Network for Joint Entity and Relation Extraction EMNLP 2021

SSD-based Object Detection in PyTorch

Code for our CVPR2021 paper coordinate attention

A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

OpenMMLab Model Deployment Toolset

Writeups for the challenges from DownUnderCTF 2021

SeqTR: A Simple yet Universal Network for Visual Grounding

PyTorch implementation of SmoothGrad: removing noise by adding noise.

Omnidirectional camera calibration in python

ML-based medical imaging using Azure

Programming with Neural Surrogates of Programs

The repo contains the code to train and evaluate a system which extracts relations and explanations from dialogue.

[NeurIPS 2021] Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited Data