SOTR: Segmenting Objects with Transformers [ICCV 2021]

Last update: Dec 20, 2022

Related tags

Deep Learning SOTR

Overview

SOTR: Segmenting Objects with Transformers [ICCV 2021]

By Ruohao Guo, Dantong Niu, Liao Qu, Zhenbo Li

Introduction

This is the official implementation of SOTR.

Models

COCO Instance Segmentation Baselines with SOTR

Name	mask AP	AP_S	AP_M	AP_L	download
SOTR_R101	40.2	10.2	59.0	73.1	model
SOTR_R101_DCN	42.0	11.4	60.7	74.5	model

Installation & Quick start

First install Detectron2 following the official guide: INSTALL.md.
Then build SOTR with:

https://github.com/easton-cau/SOTR
cd SOTR
python setup.py build develop

Then follow datasets/README.md to set up the datasets (e.g., MS-COCO).

Evaluating

Download the trained models for COCO.

Run the following command

python tools/train_net.py \
    --config-file configs/SOTR/R101.yaml \
    --eval-only \
    --num-gpus 4 \
    MODEL.WEIGHTS work_dir/SOTR_R101/SOTR_R101.pth

Training

Run the following command

python tools/train_net.py \
    --config-file configs/SOTR/R101.yaml \
    --num-gpus 4 \

Acknowledgement

Thanks Detectron2 and AdelaiDet contribution to the community!

The work is supported by National Key R&D Program of China (2020YFD0900204) and Key-Area Research and Development Program of Guangdong Province China (2020B0202010009).

FAQ

If you want to improve the usability or any piece of advice, please feel free to contant directly ([email protected]).

Citation

Please consider citing our paper in your publications if the project helps your research. BibTeX reference is as follow.

@misc{guo2021sotr,
      title={SOTR: Segmenting Objects with Transformers}, 
      author={Ruohao Guo and Dantong Niu and Liao Qu and Zhenbo Li},
      year={2021},
      eprint={2108.06747},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

SOTR: Segmenting Objects with Transformers [ICCV 2021]

Related tags

Overview

SOTR: Segmenting Objects with Transformers [ICCV 2021]

Introduction

Models

COCO Instance Segmentation Baselines with SOTR

Installation & Quick start

Acknowledgement

FAQ

Citation

Owner

Semi-supervised Domain Adaptation via Minimax Entropy

Code for the paper "A Study of Face Obfuscation in ImageNet"

Stacked Hourglass Network with a Multi-level Attention Mechanism: Where to Look for Intervertebral Disc Labeling

This repository is the official implementation of Unleashing the Power of Contrastive Self-Supervised Visual Models via Contrast-Regularized Fine-Tuning (NeurIPS21).

This is the official source code for SLATE. We provide the code for the model, the training code, and a dataset loader for the 3D Shapes dataset. This code is implemented in Pytorch.

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

NUANCED is a user-centric conversational recommendation dataset that contains 5.1k annotated dialogues and 26k high-quality user turns.

Causal Imitative Model for Autonomous Driving

NeoDTI: Neural integration of neighbor information from a heterogeneous network for discovering new drug-target interactions

Snscrape-jsonl-urls-extractor - Extracts urls from jsonl produced by snscrape

A web-based application for quick, scalable, and automated hyperparameter tuning and stacked ensembling in Python.

A curated list of awesome Model-Based RL resources

机器学习、深度学习、自然语言处理等人工智能基础知识总结。

PClean: A Domain-Specific Probabilistic Programming Language for Bayesian Data Cleaning

Dynamic Slimmable Network (CVPR 2021, Oral)

Accelerating BERT Inference for Sequence Labeling via Early-Exit

DIRL: Domain-Invariant Representation Learning

NeuroMorph: Unsupervised Shape Interpolation and Correspondence in One Go

OMNIVORE is a single vision model for many different visual modalities

The source code and data of the paper "Instance-wise Graph-based Framework for Multivariate Time Series Forecasting".