Official PyTorch Implementation of Mask-aware IoU and maYOLACT Detector [BMVC2021]

Last update: Sep 29, 2022

Related tags

Overview

The official implementation of Mask-aware IoU and maYOLACT detector. Our implementation is based on mmdetection.

Mask-aware IoU for Anchor Assignment in Real-time Instance Segmentation,
Kemal Oksuz, Baris Can Cam, Fehmi Kahraman, Zeynep Sonat Baltaci, Emre Akbas, Sinan Kalkan, BMVC 2021. (arXiv pre-print)

Summary

Mask-aware IoU: Mask-aware IoU (maIoU) is an IoU variant for better anchor assignment to supervise instance segmentation methods. Unlike the standard IoU, Mask-aware IoU also considers the ground truth masks while assigning a proximity score for an anchor. As a result, for example, if an anchor box overlaps with a ground truth box, but not with the mask of the ground truth, e.g. due to occlusion, then it has a lower score compared to IoU. Please check out the examples below for more insight. Replacing IoU by our maIoU in the state of the art ATSS assigner yields both performance improvement and efficiency (i.e. faster inference) compared to the standard YOLACT method.

maYOLACT Detector: Thanks to the efficiency due to ATSS with maIoU assigner, we incorporate more training tricks into YOLACT, and built maYOLACT Detector which is still real-time but significantly powerful (around 6 AP) than YOLACT. Our best maYOLACT model reaches SOTA performance by 37.7 mask AP on COCO test-dev at 25 fps.

How to Cite

Please cite the paper if you benefit from our paper or the repository:

@inproceedings{maIoU,
       title = {Mask-aware IoU for Anchor Assignment in Real-time Instance Segmentation},
       author = {Kemal Oksuz and Baris Can Cam and Fehmi Kahraman and Zeynep Sonat Baltaci and Sinan Kalkan and Emre Akbas},
       booktitle = {The British Machine Vision Conference (BMCV)},
       year = {2021}
}

Specification of Dependencies and Preparation

Please see get_started.md for requirements and installation of mmdetection.
Please refer to introduction.md for dataset preparation and basic usage of mmdetection.

Trained Models

Here, we report results in terms of AP (higher better) and oLRP (lower better).

Multi-stage Object Detection

Comparison of Different Assigners (on COCO minival)

Scale	Assigner	mask AP	mask oLRP	Log	Config	Model
400	Fixed IoU	24.8	78.3	log	config	model
400	ATSS w. IoU	25.3	77.7	log	config	model
400	ATSS w. maIoU	26.1	77.1	log	config	model
550	Fixed IoU	28.5	75.2	log	config	model
550	ATSS w. IoU	29.3	74.5	log	config	model
550	ATSS w. maIoU	30.4	73.7	log	config	model
700	Fixed IoU	29.7	74.3	log	config	model
700	ATSS w. IoU	30.8	73.3	log	config	model
700	ATSS w. maIoU	31.8	72.5	log	config	model

maYOLACT Detector (on COCO test-dev)

Scale	Backbone	mask AP	fps	Log	Config	Model
maYOLACT-550	ResNet-50	35.2	30	Coming Soon
maYOLACT-700	ResNet-50	37.7	25	Coming Soon

Running the Code

Training Code

The configuration files of all models listed above can be found in the configs/mayolact folder. You can follow get_started.md for training code. As an example, to train maYOLACT using images with 550 scale on 4 GPUs as we did, use the following command:

./tools/dist_train.sh configs/mayolact/mayolact_r50_4x8_coco_scale550.py 4

Test Code

The configuration files of all models listed above can be found in the configs/mayolact folder. You can follow get_started.md for test code. As an example, first download a trained model using the links provided in the tables below or you train a model, then run the following command to test a model model on multiple GPUs:

./tools/dist_test.sh configs/mayolact/mayolact_r50_4x8_coco_scale550.py ${CHECKPOINT_FILE} 4 --eval bbox segm

You can also test a model on a single GPU with the following example command:

python tools/test.py configs/mayolact/mayolact_r50_4x8_coco_scale550.py ${CHECKPOINT_FILE} --eval bbox segm

Official PyTorch Implementation of Mask-aware IoU and maYOLACT Detector [BMVC2021]

Related tags

Overview

Summary

How to Cite

Specification of Dependencies and Preparation

Trained Models

Multi-stage Object Detection

Comparison of Different Assigners (on COCO minival)

maYOLACT Detector (on COCO test-dev)

Running the Code

Training Code

Test Code

Owner

Kemal Oksuz

《Where am I looking at? Joint Location and Orientation Estimation by Cross-View Matching》(CVPR 2020)

U-Net for GBM

Modified prey-predator system - Modified prey–predator model describes the rate of change for each species by adding coupling terms.

🔅 Shapash makes Machine Learning models transparent and understandable by everyone

2021 National Underwater Robotics Vision Optics

Official PyTorch Implementation of HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning (NeurIPS 2021 Spotlight)

Unsupervised Image-to-Image Translation

This repository contains the code for TABS, a 3D CNN-Transformer hybrid automated brain tissue segmentation algorithm using T1w structural MRI scans

Guided Internet-delivered Cognitive Behavioral Therapy Adherence Forecasting

Invariant Causal Prediction for Block MDPs

Simple-Image-Classification - Simple Image Classification Code (PyTorch)

Weakly Supervised Segmentation by Tensorflow.

E2EDNA2 - An automated pipeline for simulation of DNA aptamers complexed with small molecules and short peptides

This is the implementation of the paper "Self-supervised Outdoor Scene Relighting"

Code for ACL 21: Generating Query Focused Summaries from Query-Free Resources

RLMeta is a light-weight flexible framework for Distributed Reinforcement Learning Research.

UNet model with VGG11 encoder pre-trained on Kaggle Carvana dataset

Python scripts form performing stereo depth estimation using the CoEx model in ONNX.

Official code release for "GRAF: Generative Radiance Fields for 3D-Aware Image Synthesis"

hipCaffe: the HIP port of Caffe