implement of SwiftNet:Real-time Video Object Segmentation

Last update: Dec 14, 2022

Related tags

Overview

SwiftNet

The official PyTorch implementation of SwiftNet:Real-time Video Object Segmentation, which has been accepted by CVPR2021.

Requirements

Python >= 3.6
Pytorch 1.5
Numpy
Pillow
opencv-python
scipy
tqdm

Training

The training pipeline of Swiftnet is similar with the training pipeline of STM, which can be found in our reproduced STM training code.

Inference

Usage

python eval.py -g 0 -y 17 -s val -D 'path to davis'

Performance

Performance on Davis-17 val set.

backbone	J&F	J	F	FPS	weights
resnet-18	77.6	75.5	79.7	65	`link`

Note: The FPS is tested on one P100, which does not include the time of image loading and evaluation cost.

Acknowledgement

This repository is partially founded on the official STM repository.

Citation

If you find this repository helpful and want to cite SwiftNet in your own projects, please use the following citation info.

@inproceedings{wang2021swiftnet,
  title={SwiftNet: Real-time Video Object Segmentation},
  author={Wang, Haochen and Jiang, Xiaolong and Ren, Haibing and Hu, Yao and Bai, Song},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={1296--1305},
  year={2021}
}

implement of SwiftNet:Real-time Video Object Segmentation

Related tags

Overview

SwiftNet

Requirements

Training

Inference

Performance

Acknowledgement

Citation

Owner

haochen wang

Syntax-Aware Action Targeting for Video Captioning

Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.

A forwarding MPI implementation that can use any other MPI implementation via an MPI ABI

Speech Recognition using DeepSpeech2.

A self-supervised learning framework for audio-visual speech

Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning

BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond

A toy project using OpenCV and PyMunk

A full pipeline AutoML tool for tabular data

This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"

Lex Rosetta: Transfer of Predictive Models Across Languages, Jurisdictions, and Legal Domains

Repo for 2021 SDD assessment task 2, by Felix, Anna, and James.

High dimensional black-box optimizer using Latent Action Monte Carlo Tree Search algorithm

Implementation for Learning to Track with Object Permanence

:boar: :bear: Deep Learning based Python Library for Stock Market Prediction and Modelling

Code for paper "Learning to Reweight Examples for Robust Deep Learning"

Time series annotation library.

ACL'2021: LM-BFF: Better Few-shot Fine-tuning of Language Models

This is the official repository of the paper Stocastic bandits with groups of similar arms (NeurIPS 2021). It contains the code that was used to compute the figures and experiments of the paper.

[CVPR 2021] Forecasting the panoptic segmentation of future video frames