Learning to Segment Instances in Videos with Spatial Propagation Network

Last update: Sep 28, 2022

Related tags

Deep Learning Seg-with-SPN

Overview

Learning to Segment Instances in Videos with Spatial Propagation Network

This paper is available at the 2017 DAVIS Challenge website.

Check our results in this video.

Contact: Jingchun Cheng (chengjingchun at gmail dot com)

Cite the Paper

If you find that our method is useful in your research, please cite:

@article{DAVIS2017-6th,
  author = {J. Cheng and S. Liu and Y.-H. Tsai and W.-C. Hung and S. Gupta and J. Gu and J. Kautz and S. Wang and M.-H. Yang}, 
  title = {Learning to Segment Instances in Videos with Spatial Propagation Network}, 
  journal = {The 2017 DAVIS Challenge on Video Object Segmentation - CVPR Workshops}, 
  year = {2017}
}

About the Code

The code released here mainly consistes of two parts in the paper: foreground segmentation and instance recognition.
It contains the parent net for foreground segmentation and training codes for instance recognition networks.
The matlab_code folder contains a simple version of our CRAF step for segmentation refinement.

Requirements

Install caffe and pycaffe at http://caffe.berkeleyvision.org/.
Download the DAVIS 2017 dataset and put it in the data folder.
Download the pre-trained foreground/background model here and put it in the pretrained folder.

Training

Train the per-object recognition model.
cd training
python solve.py PATH_OF_MODEL PATH_OF_SOLVER
Foe example, on the 'choreography' video for the 1st object, run:
python solve.py ../pretrained/PN_ResNetF.caffemodel ../ResNetF/testnet_per_obj/choreography/solver_1.prototxt

Testing

Test the general foreground/backgroung model.
python infer_test_fgbg.py PATH_OF_MODEL PATH_OF_RESULT VIDEO_NAME
Foe example, on the 'lions' video, run:
python infer_test_fgbg.py pretrained/PN_ResNetF.caffemodel results/fgbg lions
Test the object instance model.
python infer_test_perobj.py MODEL_ITERATION VIDEO_NAME OBJECT_ID
For example, on the 'lions' video for the 2nd object, run:
python infer_test_perobj.py 3000 lions 2
Run example_CRAF.m in the matlab_code folder for a demo on CRAF segmentation refinement.

Download Our Segmentation Results on 2017 DAVIS Challenge

General foreground/background segmentation here
Instance-level object segmentation without refinement here
Final instance-level object segmentation with refinement here

Note

The model and code are available for non-commercial research purposes only.

09/2017: code and model released
03/2018: pre-trained model updated

Learning to Segment Instances in Videos with Spatial Propagation Network

Related tags

Overview

Learning to Segment Instances in Videos with Spatial Propagation Network

Cite the Paper

About the Code

Requirements

Training

Testing

Download Our Segmentation Results on 2017 DAVIS Challenge

Note

Owner

Jingchun Cheng

OpenVisionAPI server

for a paper about leveraging discourse markers for training new models

The repository for freeCodeCamp's YouTube course, Algorithmic Trading in Python

Data and analysis code for an MS on SK VOC genomes phenotyping/neutralisation assays

Create animations for the optimization trajectory of neural nets

Self-supervised Point Cloud Prediction Using 3D Spatio-temporal Convolutional Networks

Prototypical Networks for Few shot Learning in PyTorch

Reducing Information Bottleneck for Weakly Supervised Semantic Segmentation (NeurIPS 2021)

Python wrapper to access the amazon selling partner API

Converts given image (png, jpg, etc) to amogus gif.

利用yolov5和TensorRT从0到1实现目标检测的模型训练到模型部署全过程

LightLog is an open source deep learning based lightweight log analysis tool for log anomaly detection.

Jigsaw Rate Severity of Toxic Comments

High dimensional black-box optimizer using Latent Action Monte Carlo Tree Search algorithm

Deep learning models for change detection of remote sensing images

[AAAI-2021] Visual Boundary Knowledge Translation for Foreground Segmentation

DeepAL: Deep Active Learning in Python

Punctuation Restoration using Transformer Models for High-and Low-Resource Languages

Asymmetric metric learning for knowledge transfer

A python package to perform same transformation to coco-annotation as performed on the image.