Learning to Segment Instances in Videos with Spatial Propagation Network

Last update: Sep 28, 2022

Related tags

Deep Learning Seg-with-SPN

Overview

Learning to Segment Instances in Videos with Spatial Propagation Network

This paper is available at the 2017 DAVIS Challenge website.

Check our results in this video.

Contact: Jingchun Cheng (chengjingchun at gmail dot com)

Cite the Paper

If you find that our method is useful in your research, please cite:

@article{DAVIS2017-6th,
  author = {J. Cheng and S. Liu and Y.-H. Tsai and W.-C. Hung and S. Gupta and J. Gu and J. Kautz and S. Wang and M.-H. Yang}, 
  title = {Learning to Segment Instances in Videos with Spatial Propagation Network}, 
  journal = {The 2017 DAVIS Challenge on Video Object Segmentation - CVPR Workshops}, 
  year = {2017}
}

About the Code

The code released here mainly consistes of two parts in the paper: foreground segmentation and instance recognition.
It contains the parent net for foreground segmentation and training codes for instance recognition networks.
The matlab_code folder contains a simple version of our CRAF step for segmentation refinement.

Requirements

Install caffe and pycaffe at http://caffe.berkeleyvision.org/.
Download the DAVIS 2017 dataset and put it in the data folder.
Download the pre-trained foreground/background model here and put it in the pretrained folder.

Training

Train the per-object recognition model.
cd training
python solve.py PATH_OF_MODEL PATH_OF_SOLVER
Foe example, on the 'choreography' video for the 1st object, run:
python solve.py ../pretrained/PN_ResNetF.caffemodel ../ResNetF/testnet_per_obj/choreography/solver_1.prototxt

Testing

Test the general foreground/backgroung model.
python infer_test_fgbg.py PATH_OF_MODEL PATH_OF_RESULT VIDEO_NAME
Foe example, on the 'lions' video, run:
python infer_test_fgbg.py pretrained/PN_ResNetF.caffemodel results/fgbg lions
Test the object instance model.
python infer_test_perobj.py MODEL_ITERATION VIDEO_NAME OBJECT_ID
For example, on the 'lions' video for the 2nd object, run:
python infer_test_perobj.py 3000 lions 2
Run example_CRAF.m in the matlab_code folder for a demo on CRAF segmentation refinement.

Download Our Segmentation Results on 2017 DAVIS Challenge

General foreground/background segmentation here
Instance-level object segmentation without refinement here
Final instance-level object segmentation with refinement here

Note

The model and code are available for non-commercial research purposes only.

09/2017: code and model released
03/2018: pre-trained model updated

Learning to Segment Instances in Videos with Spatial Propagation Network

Related tags

Overview

Learning to Segment Instances in Videos with Spatial Propagation Network

Cite the Paper

About the Code

Requirements

Training

Testing

Download Our Segmentation Results on 2017 DAVIS Challenge

Note

Owner

Jingchun Cheng

Unified tracking framework with a single appearance model

Structured Edge Detection Toolbox

The code repository for "RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection" (ACM MM'21)

NudeNet: Neural Nets for Nudity Classification, Detection and selective censoring

Two-stage CenterNet

A pytorch-version implementation codes of paper: "BSN++: Complementary Boundary Regressor with Scale-Balanced Relation Modeling for Temporal Action Proposal Generation"

NeRF visualization library under construction

Syllabic Quantity Patterns as Rhythmic Features for Latin Authorship Attribution

Scalable implementation of Lee / Mykland (2012) and Ait-Sahalia / Jacod (2012) Jump tests for noisy high frequency data

Team Enigma at ArgMining 2021 Shared Task: Leveraging Pretrained Language Models for Key Point Matching

Cookiecutter PyTorch Lightning

Implementation of ViViT: A Video Vision Transformer

SustainBench: Benchmarks for Monitoring the Sustainable Development Goals with Machine Learning

Linear image-to-image translation

This is the code of "Multi-view Contrastive Graph Clustering" in NeurlPS 2021.

A working implementation of the Categorical DQN (Distributional RL).

FPSAutomaticAiming——基于YOLOV5的FPS类游戏自动瞄准AI

This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"

IhoneyBakFileScan Modify - 批量网站备份文件扫描器，增加文件规则，优化内存占用

PyTorch common framework to accelerate network implementation, training and validation