RefineMask (CVPR 2021)

Last update: Jan 07, 2023

Related tags

Overview

RefineMask: Towards High-Quality Instance Segmentation
with Fine-Grained Features (CVPR 2021)

This repo is the official implementation of RefineMask: Towards High-Quality Instance Segmentation with Fine-Grained Features.

Framework

Main Results

Results on COCO

Method	Backbone	Schedule	AP	AP^*	Checkpoint
Mask R-CNN	R50-FPN	1x	34.7	36.8
RefineMask	R50-FPN	1x	37.3	40.6	download
Mask R-CNN	R50-FPN	2x	35.4	37.7
RefineMask	R50-FPN	2x	37.8	41.2	download
Mask R-CNN	R101-FPN	1x	36.1	38.4
RefineMask	R101-FPN	1x	38.6	41.8	download
Mask R-CNN	R101-FPN	2x	36.6	39.3
RefineMask	R101-FPN	2x	39.0	42.4	download

Note: No data augmentations except standard horizontal flipping were used.

Results on LVIS

Method	Backbone	Schedule	AP	AP_r	AP_c	AP_f	Checkpoint
Mask R-CNN	R50-FPN	1x	22.1	10.1	21.7	30.0
RefineMask	R50-FPN	1x	25.7	13.8	24.9	31.8	download
Mask R-CNN	R101-FPN	1x	23.7	12.3	23.2	29.1
RefineMask	R101-FPN	1x	27.1	15.6	26.2	33.1	download

Results on Cityscapes

Method	Backbone	Schedule	AP	AP_S	AP_M	AP_L	Checkpoint
Mask R-CNN	R50-FPN	1x	33.8	12.0	31.5	51.8
RefineMask	R50-FPN	1x	37.6	14.0	35.4	57.9	download

Efficiency of RefineMask

Method	AP	AP^*	FPS
Mask R-CNN	34.7	36.8	15.7
PointRend	35.6	38.7	11.4
HTC	37.4	40.7	4.4
RefineMask	37.3	40.9	11.4

Usage

Requirements

Python 3.6+
Pytorch 1.5.0
mmcv-full 1.0.5

Datasets

data
  ├── coco
  |   ├── annotations
  │   │   │   ├── instances_train2017.json
  │   │   │   ├── instances_val2017.json
  │   │   │   ├── lvis_v0.5_val_cocofied.json
  │   ├── train2017
  │   │   ├── 000000004134.png
  │   │   ├── 000000031817.png
  │   │   ├── ......
  │   ├── val2017
  │   ├── test2017
  ├── lvis
  |   ├── annotations
  │   │   │   ├── lvis_v1_train.json
  │   │   │   ├── lvis_v1_val.json
  │   ├── train2017
  │   │   ├── 000000004134.png
  │   │   ├── 000000031817.png
  │   │   ├── ......
  │   ├── val2017
  │   ├── test2017
  ├── cityscapes
  |   ├── annotations
  │   │   │   ├── instancesonly_filtered_gtFine_train.json
  │   │   │   ├── instancesonly_filtered_gtFine_val.json
  │   ├── leftImg8bit
  │   |   ├── train
  │   │   ├── val
  │   │   ├── test

Note: We used the lvis-v1.0 dataset which consists of 1203 categories.

Training

./scripts/dist_train.sh ./configs/refinemask/coco/r50-refinemask-1x.py 8

Note: The codes only support batch size 1 per GPU, and we trained all models with a total batch size 16x1. If you train models with a total batch size 8x1, the performance may drop. We will support batch size 2 or more per GPU later. You can use ./scripts/slurm_train.sh for training with multi-nodes.

Inference

./scripts/dist_test.sh ./configs/refinemask/coco/r50-refinemask-1x.py xxxx.pth 8

Citation

@article{zhang2021refinemask,
  title={RefineMask: Towards High-Quality Instance Segmentation with Fine-Grained Features},
  author={Gang, Zhang and Xin, Lu and Jingru, Tan and Jianmin, Li and Zhaoxiang, Zhang and Quanquan, Li and Xiaolin, Hu},
  journal={arXiv preprint arXiv:2104.08569},
  year={2021}
}

RefineMask (CVPR 2021)

Related tags

Overview

RefineMask: Towards High-Quality Instance Segmentation
with Fine-Grained Features (CVPR 2021)

Framework

Main Results

Results on COCO

Results on LVIS

Results on Cityscapes

Efficiency of RefineMask

Usage

Requirements

Datasets

Training

Inference

Citation

Owner

Gang Zhang

AgeGuesser: deep learning based age estimation system. Powered by EfficientNet and Yolov5

Commonsense Ability Tests

Tensorflow Repo for "DeepGCNs: Can GCNs Go as Deep as CNNs?"

This repository contains the reference implementation for our proposed Convolutional CRFs.

Official implementation of Long-Short Transformer in PyTorch.

The versatile ocean simulator, in pure Python, powered by JAX.

OpenFed: A Comprehensive and Versatile Open-Source Federated Learning Framework

Digital Twin Mobility Profiling: A Spatio-Temporal Graph Learning Approach

A Light CNN for Deep Face Representation with Noisy Labels

Crab is a ﬂexible, fast recommender engine for Python that integrates classic information ﬁltering recommendation algorithms in the world of scientiﬁc Python packages (numpy, scipy, matplotlib).

Example scripts for the detection of lanes using the ultra fast lane detection model in ONNX.

COLMAP - Structure-from-Motion and Multi-View Stereo

Official code for "On the Frequency Bias of Generative Models", NeurIPS 2021

Understanding and Overcoming the Challenges of Efficient Transformer Quantization

An open-source outlier detection package by Getcontact Data Team

A simple implementation of Kalman filter in Multi Object Tracking

Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP

CSPML (crystal structure prediction with machine learning-based element substitution)

A way to store images in YAML.

Accepted at ICCV-2021: Workshop on Computer Vision for Automated Medical Diagnosis (CVAMD)

RefineMask (CVPR 2021)

Related tags

Overview

RefineMask: Towards High-Quality Instance Segmentation with Fine-Grained Features (CVPR 2021)

Framework

Main Results

Results on COCO

Results on LVIS

Results on Cityscapes

Efficiency of RefineMask

Usage

Requirements

Datasets

Training

Inference

Citation

Owner

Gang Zhang

AgeGuesser: deep learning based age estimation system. Powered by EfficientNet and Yolov5

Commonsense Ability Tests

Tensorflow Repo for "DeepGCNs: Can GCNs Go as Deep as CNNs?"

This repository contains the reference implementation for our proposed Convolutional CRFs.

Official implementation of Long-Short Transformer in PyTorch.

The versatile ocean simulator, in pure Python, powered by JAX.

OpenFed: A Comprehensive and Versatile Open-Source Federated Learning Framework

Digital Twin Mobility Profiling: A Spatio-Temporal Graph Learning Approach

A Light CNN for Deep Face Representation with Noisy Labels

Crab is a ﬂexible, fast recommender engine for Python that integrates classic information ﬁltering recommendation algorithms in the world of scientiﬁc Python packages (numpy, scipy, matplotlib).

Example scripts for the detection of lanes using the ultra fast lane detection model in ONNX.

COLMAP - Structure-from-Motion and Multi-View Stereo

Official code for "On the Frequency Bias of Generative Models", NeurIPS 2021

Understanding and Overcoming the Challenges of Efficient Transformer Quantization

An open-source outlier detection package by Getcontact Data Team

A simple implementation of Kalman filter in Multi Object Tracking

Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP

CSPML (crystal structure prediction with machine learning-based element substitution)

A way to store images in YAML.

Accepted at ICCV-2021: Workshop on Computer Vision for Automated Medical Diagnosis (CVAMD)

RefineMask: Towards High-Quality Instance Segmentation
with Fine-Grained Features (CVPR 2021)