Implementation of the paper ''Implicit Feature Refinement for Instance Segmentation''.

Last update: Dec 28, 2022

Related tags

Overview

Implicit Feature Refinement for Instance Segmentation

This repository is an official implementation of the ACM Multimedia 2021 paper Implicit Feature Refinement for Instance Segmentation.

Introduction

TL; DR. Implicit feature refinement (IFR) enjoys several advantages: 1) simulates an infinite-depth refinement network while only requiring parameters of single residual block; 2) produces high-level equilibrium instance features of global receptive field; 3) serves as a general plug-and-play module easily extended to most object recognition frameworks.

Get Started

Install cvpods following the instructions

# Install cvpods
git clone https://github.com/Megvii-BaseDetection/cvpods.git
cd cvpods 
## build cvpods (requires GPU)
python3 setup.py build develop
## preprare data path
mkdir datasets
ln -s /path/to/your/coco/dataset datasets/coco

To save the training and testing time, the explicit form of our IFR, annotated with "weight_sharing", is provided on mask_rcnn to achieve competitive performance.
For fast evaluation, please download trained model from here.
Run the project

git clone https://github.com/lufanma/IFR.git

# for example(e.g. mask_rcnn.ifr)
cd IFR/mask_rcnn.ifr.res50.fpn.coco.multiscale.1x/

# train
sh pods_train.sh

# test
sh pods_test.sh
# test with provided weights
sh pods_test.sh \
    MODEL.WEIGHTS /path/to/your/save_dir/ckpt.pth # optional
    OUTPUT_DIR /path/to/your/save_dir # optional

Results

Model	AP	AP50	AP75	APs	APm	APl	Link
mask_rcnn.ifr.res50.fpn.coco.multiscale.1x	36.3	56.8	39.2	17.3	39.0	52.2	download
mask_rcnn.res50.fpn.coco.multiscale.weight_sharing.1x	35.9	56.7	38.5	17.1	38.5	51.8	download
cascade_rcnn.ifr.res50.fpn.coco.800size.1x	36.9	57.1	39.8	17.4	39.3	54.6	download

Citing IFR

If you find IFR useful to your research, please consider citing:

@inproceedings{ma2021implicit,
  title={Implicit Feature Refinement for Instance Segmentation},
  author={Ma, Lufan and Wang, Tiancai and Dong, Bin and Yan, Jiangpeng and Li, Xiu and Zhang, Xiangyu},
  booktitle={Proceedings of the 29th ACM International Conference on Multimedia},
  pages={3088--3096},
  year={2021}
}

Given thanks to the open source of DEQ and MDEQ, our IFR is developed based on them.

Implementation of the paper ''Implicit Feature Refinement for Instance Segmentation''.

Related tags

Overview

Implicit Feature Refinement for Instance Segmentation

Introduction

Get Started

Results

Citing IFR

Owner

Lufan Ma

Source code for our paper "Do Not Trust Prediction Scores for Membership Inference Attacks"

Repo for "TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets" at [email protected]

Gauge equivariant mesh cnn

MinkLoc3D-SI: 3D LiDAR place recognition with sparse convolutions,spherical coordinates, and intensity

Visyerres sgdf woob - Modules Woob pour l'intranet et autres sites Scouts et Guides de France

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

End-to-end speech secognition toolkit

Breaching - Breaching privacy in federated learning scenarios for vision and text

Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"

Implementation of our paper "Video Playback Rate Perception for Self-supervised Spatio-Temporal Representation Learning".

Attention Probe: Vision Transformer Distillation in the Wild

implementation of the paper "MarginGAN: Adversarial Training in Semi-Supervised Learning"

Model parallel transformers in Jax and Haiku

Auto White-Balance Correction for Mixed-Illuminant Scenes

Look Closer: Bridging Egocentric and Third-Person Views with Transformers for Robotic Manipulation

(CVPR2021) DANNet: A One-Stage Domain Adaptation Network for Unsupervised Nighttime Semantic Segmentation

A dual benchmarking study of visual forgery and visual forensics techniques

An implementation for Neural Architecture Search with Random Labels (CVPR 2021 poster) on Pytorch.

SlideGraph+: Whole Slide Image Level Graphs to Predict HER2 Status in Breast Cancer

Setup and customize deep learning environment in seconds.