Official PyTorch Implementation of Mask-aware IoU and maYOLACT Detector [BMVC2021]

Last update: Sep 29, 2022

Related tags

Overview

The official implementation of Mask-aware IoU and maYOLACT detector. Our implementation is based on mmdetection.

Mask-aware IoU for Anchor Assignment in Real-time Instance Segmentation,
Kemal Oksuz, Baris Can Cam, Fehmi Kahraman, Zeynep Sonat Baltaci, Emre Akbas, Sinan Kalkan, BMVC 2021. (arXiv pre-print)

Summary

Mask-aware IoU: Mask-aware IoU (maIoU) is an IoU variant for better anchor assignment to supervise instance segmentation methods. Unlike the standard IoU, Mask-aware IoU also considers the ground truth masks while assigning a proximity score for an anchor. As a result, for example, if an anchor box overlaps with a ground truth box, but not with the mask of the ground truth, e.g. due to occlusion, then it has a lower score compared to IoU. Please check out the examples below for more insight. Replacing IoU by our maIoU in the state of the art ATSS assigner yields both performance improvement and efficiency (i.e. faster inference) compared to the standard YOLACT method.

maYOLACT Detector: Thanks to the efficiency due to ATSS with maIoU assigner, we incorporate more training tricks into YOLACT, and built maYOLACT Detector which is still real-time but significantly powerful (around 6 AP) than YOLACT. Our best maYOLACT model reaches SOTA performance by 37.7 mask AP on COCO test-dev at 25 fps.

How to Cite

Please cite the paper if you benefit from our paper or the repository:

@inproceedings{maIoU,
       title = {Mask-aware IoU for Anchor Assignment in Real-time Instance Segmentation},
       author = {Kemal Oksuz and Baris Can Cam and Fehmi Kahraman and Zeynep Sonat Baltaci and Sinan Kalkan and Emre Akbas},
       booktitle = {The British Machine Vision Conference (BMCV)},
       year = {2021}
}

Specification of Dependencies and Preparation

Please see get_started.md for requirements and installation of mmdetection.
Please refer to introduction.md for dataset preparation and basic usage of mmdetection.

Trained Models

Here, we report results in terms of AP (higher better) and oLRP (lower better).

Multi-stage Object Detection

Comparison of Different Assigners (on COCO minival)

Scale	Assigner	mask AP	mask oLRP	Log	Config	Model
400	Fixed IoU	24.8	78.3	log	config	model
400	ATSS w. IoU	25.3	77.7	log	config	model
400	ATSS w. maIoU	26.1	77.1	log	config	model
550	Fixed IoU	28.5	75.2	log	config	model
550	ATSS w. IoU	29.3	74.5	log	config	model
550	ATSS w. maIoU	30.4	73.7	log	config	model
700	Fixed IoU	29.7	74.3	log	config	model
700	ATSS w. IoU	30.8	73.3	log	config	model
700	ATSS w. maIoU	31.8	72.5	log	config	model

maYOLACT Detector (on COCO test-dev)

Scale	Backbone	mask AP	fps	Log	Config	Model
maYOLACT-550	ResNet-50	35.2	30	Coming Soon
maYOLACT-700	ResNet-50	37.7	25	Coming Soon

Running the Code

Training Code

The configuration files of all models listed above can be found in the configs/mayolact folder. You can follow get_started.md for training code. As an example, to train maYOLACT using images with 550 scale on 4 GPUs as we did, use the following command:

./tools/dist_train.sh configs/mayolact/mayolact_r50_4x8_coco_scale550.py 4

Test Code

The configuration files of all models listed above can be found in the configs/mayolact folder. You can follow get_started.md for test code. As an example, first download a trained model using the links provided in the tables below or you train a model, then run the following command to test a model model on multiple GPUs:

./tools/dist_test.sh configs/mayolact/mayolact_r50_4x8_coco_scale550.py ${CHECKPOINT_FILE} 4 --eval bbox segm

You can also test a model on a single GPU with the following example command:

python tools/test.py configs/mayolact/mayolact_r50_4x8_coco_scale550.py ${CHECKPOINT_FILE} --eval bbox segm

Official PyTorch Implementation of Mask-aware IoU and maYOLACT Detector [BMVC2021]

Related tags

Overview

Summary

How to Cite

Specification of Dependencies and Preparation

Trained Models

Multi-stage Object Detection

Comparison of Different Assigners (on COCO minival)

maYOLACT Detector (on COCO test-dev)

Running the Code

Training Code

Test Code

Owner

Kemal Oksuz

An integration of several popular automatic augmentation methods, including OHL (Online Hyper-Parameter Learning for Auto-Augmentation Strategy) and AWS (Improving Auto Augment via Augmentation Wise Weight Sharing) by Sensetime Research.

Build fully-functioning computer vision models with PyTorch

🍷 Gracefully claim weekly free games and monthly content from Epic Store.

ViSER: Video-Specific Surface Embeddings for Articulated 3D Shape Reconstruction

Wav2Vec for speech recognition, classification, and audio classification

MetaBalance: High-Performance Neural Networks for Class-Imbalanced Data

Computationally efficient algorithm that identifies boundary points of a point cloud.

Companion repo of the UCC 2021 paper "Predictive Auto-scaling with OpenStack Monasca"

Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch

Transformer in Vision

《Deep Single Portrait Image Relighting》(ICCV 2019)

Robust fine-tuning of zero-shot models

An introduction to satellite image analysis using Python + OpenCV and JavaScript + Google Earth Engine

Label Studio is a multi-type data labeling and annotation tool with standardized output format

A curated list of long-tailed recognition resources.

This is the code of "Multi-view Contrastive Graph Clustering" in NeurlPS 2021.

The code uses SegFormer for Semantic Segmentation on Drone Dataset.

The official implementation of the Interspeech 2021 paper WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution.

EgoNN: Egocentric Neural Network for Point Cloud Based 6DoF Relocalization at the City Scale

Show-attend-and-tell - TensorFlow Implementation of "Show, Attend and Tell"