code for CVPR paper Zero-shot Instance Segmentation

Last update: Dec 13, 2022

Overview

Code for CVPR2021 paper

Zero-shot Instance Segmentation

Code requirements

python: python3.7
nvidia GPU
pytorch1.1.0
GCC >=5.4
NCCL 2
the other python libs in requirement.txt

Install

conda create -n zsi python=3.7 -y
conda activate zsi

conda install pytorch=1.1.0 torchvision=0.3.0 cudatoolkit=10.0 -c pytorch

pip install cython && pip --no-cache-dir install -r requirements.txt
   
python setup.py develop

Dataset prepare

Download the train and test annotations files for zsi from annotations, put all json label file to
```
data/coco/annotations/
```
Download MSCOCO-2014 dataset and unzip the images it to path：
```
data/coco/train2014/
data/coco/val2014/
```

Training:

48/17 split:

   chmod +x tools/dist_train.sh
   ./tools/dist_train.sh configs/zsi/train/zero-shot-mask-rcnn-BARPN-bbox_mask_sync_bg_decoder.py 4

65/15 split:

chmod +x tools/dist_train.sh
./tools/dist_train.sh configs/zsi/train/zero-shot-mask-rcnn-BARPN-bbox_mask_sync_bg_65_15_decoder_notanh.py 4

Inference & Evaluate:

ZSI task:

48/17 split ZSI task:

download 48/17 ZSI model, put it in checkpoints/ZSI_48_17.pth

inference:

chmod +x tools/dist_test.sh
./tools/dist_test.sh configs/zsi/48_17/test/zsi/zero-shot-mask-rcnn-BARPN-bbox_mask_sync_bg_decoder.py checkpoints/ZSI_48_17.pth 4 --json_out results/zsi_48_17.json

our results zsi_48_17.bbox.json and zsi_48_17.segm.json can also downloaded from zsi_48_17_reults.

evaluate:

for zsd performance

python tools/zsi_coco_eval.py results/zsi_48_17.bbox.json --ann data/coco/annotations/instances_val2014_unseen_48_17.json

for zsi performance

python tools/zsi_coco_eval.py results/zsi_48_17.segm.json --ann data/coco/annotations/instances_val2014_unseen_48_17.json --types segm

65/15 split ZSI task:

download 65/15 ZSI model, put it in checkpoints/ZSI_65_15.pth

inference:

chmod +x tools/dist_test.sh
./toools/dist_test.sh configs/zsi/65_15/test/zsi/zero-shot-mask-rcnn-BARPN-bbox_mask_sync_bg_65_15_decoder_notanh.py checkpoints/ZSI_65_15.pth 4 --json_out results/zsi_65_15.json

our results zsi_65_15.bbox.json and zsi_65_15.segm.json can also downloaded from zsi_65_15_reults.

evaluate:

for zsd performance

python tools/zsi_coco_eval.py results/zsi_65_15.bbox.json --ann data/coco/annotations/instances_val2014_unseen_65_15.json

for zsi performance

python tools/zsi_coco_eval.py results/zsi_65_15.segm.json --ann data/coco/annotations/instances_val2014_unseen_65_15.json --types segm

GZSI task:

48/17 split GZSI task:

use the same model file ZSI_48_17.pth in ZSI task

inference:

chmod +x tools/dist_test.sh
./tools/dist_test.sh configs/zsi/48_17/test/gzsi/zero-shot-mask-rcnn-BARPN-bbox_mask_sync_bg_decoder_gzsi.py checkpoints/ZSI_48_17.pth 4 --json_out results/gzsi_48_17.json

our results gzsi_48_17.bbox.json and gzsi_48_17.segm.json can also downloaded from gzsi_48_17_results.

evaluate:

for gzsd

python tools/gzsi_coco_eval.py results/gzsi_48_17.bbox.json --ann data/coco/annotations/instances_val2014_gzsi_48_17.json --gzsi --num-seen-classes 48

for gzsi

python tools/gzsi_coco_eval.py results/gzsi_48_17.segm.json --ann data/coco/annotations/instances_val2014_gzsi_48_17.json --gzsi --num-seen-classes 48 --types segm

65/15 split GZSI task:

use the same model file ZSI_48_17.pth in ZSI task

inference:

chmod +x tools/dist_test.sh
./tools/dist_test.sh configs/zsi/65_15/test/gzsi/zero-shot-mask-rcnn-BARPN-bbox_mask_sync_bg_65_15_decoder_notanh_gzsi.py checkpoints/ZSI_65_15.pth 4 --json_out results/gzsi_65_15.json

our results gzsi_65_15.bbox.json and gzsi_65_15.segm.json can also downloaded from gzsi_65_15_results.

evaluate:

for gzsd

python tools/gzsi_coco_eval.py results/gzsi_65_15.bbox.json --ann data/coco/annotations/instances_val2014_gzsi_65_15.json --gzsd --num-seen-classes 65

for gzsi

python tools/gzsi_coco_eval.py results/gzsi_65_15.segm.json --ann data/coco/annotations/instances_val2014_gzsi_65_15.json --gzsd --num-seen-classes 65 --types segm

License

ZSI is released under MIT License.

Citing

If you use ZSI in your research or wish to refer to the baseline results published here, please use the following BibTeX entries:

@InProceedings{zhengye2021zsi,
  author  =  {Ye, Zheng and Jiahong, Wu and Yongqiag, Qin and Faen, Zhang and Li, Cui},
  title   =  {Zero-shot Instance Segmentation},
  booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  month = {June},
  year = {2021}
}

code for CVPR paper Zero-shot Instance Segmentation

Related tags

Overview

Code for CVPR2021 paper

Zero-shot Instance Segmentation

Code requirements

Install

Dataset prepare

License

Citing

Owner

zhengye

Pytorch implementation of RED-SDS (NeurIPS 2021).

A 35mm camera, based on the Canonet G-III QL17 rangefinder, simulated in Python.

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Probabilistic Programming and Statistical Inference in PyTorch

Official PyTorch code for Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling (HCFlow, ICCV2021)

Deploying PyTorch Model to Production with FastAPI in CUDA-supported Docker

Official codebase for Pretrained Transformers as Universal Computation Engines.

Official code for our ICCV paper: "From Continuity to Editability: Inverting GANs with Consecutive Images"

Scalable machine learning based time series forecasting

Implementation for "Conditional entropy minimization principle for learning domain invariant representation features"

Pytorch implementations of Bayes By Backprop, MC Dropout, SGLD, the Local Reparametrization Trick, KF-Laplace, SG-HMC and more

Code and experiments for "Deep Neural Networks for Rank Consistent Ordinal Regression based on Conditional Probabilities"

pixelNeRF: Neural Radiance Fields from One or Few Images

Head and Neck Tumour Segmentation and Prediction of Patient Survival Project

Code for the paper "On the Power of Edge Independent Graph Models"

Another pytorch implementation of FCN (Fully Convolutional Networks)

DeepStruc is a Conditional Variational Autoencoder which can predict the mono-metallic nanoparticle from a Pair Distribution Function.

PyTorch Implementation of Fully Convolutional Networks. (Training code to reproduce the original result is available.)

A TensorFlow implementation of the Mnemonic Descent Method.

Classification of ecg datas for disease detection