A Survey on Deep Learning Technique for Video Segmentation

A Survey on Deep Learning Technique for Video Segmentation
Wenguan Wang, Tianfei Zhou, Fatih Porikli, David Crandall, and Luc Van Gool.

Contributing

Please feel free to create issues or pull requests to add papers.

Welcome any discussions on video segmentation at

1. Introduction

Video segmentation, i.e., partitioning video frames into multiple segments or objects, plays a critical role in a broad range of practical applications, from enhancing visual effects in movie, to understanding scenes in autonomous driving, to virtual background creation in video conferencing. In this survey, we comprehensively review two basic lines of research — video object segmentation and video semantic segmentation — by introducing their respective task settings, background concepts, perceived need, development history, and main challenges. In particular, we review eight sub-fields as given in the following figure:

2. Deep Learning-based Video Object Segmentation

3. Deep Learning-based Video Semantic Segmentation

4. Datasets

Popular Datasets in VOS and VSS

Citation

If you find our survey and repository useful for your research, please consider citing our paper:

@article{wang2021survey,
  title={A survey on deep learning technique for video segmentation},
  author={Wang, Wenguan and Zhou, Tianfei and Porikli, Fatih and Crandall, David and Van Gool, Luc},
  journal={arXiv preprint arXiv:2107.01153},
  year={2021}
}

A Survey on Deep Learning Technique for Video Segmentation

Related tags

Overview

A Survey on Deep Learning Technique for Video Segmentation

Contributing

1. Introduction

2. Deep Learning-based Video Object Segmentation

3. Deep Learning-based Video Semantic Segmentation

4. Datasets

Citation

Owner

Tianfei Zhou

SCI-AIDE : High-fidelity Few-shot Histopathology Image Synthesis for Rare Cancer Diagnosis

A Python type explainer!

Cweqgen - The CW Equation Generator

Official Chainer implementation of GP-GAN: Towards Realistic High-Resolution Image Blending (ACMMM 2019, oral)

1st Solution For ICDAR 2021 Competition on Mathematical Formula Detection

Anagram Generator in Python

Analysis code and Latex source of the manuscript describing the conditional permutation test of confounding bias in predictive modelling.

NEATEST: Evolving Neural Networks Through Augmenting Topologies with Evolution Strategy Training

Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)

Securetar - A streaming wrapper around python tarfile and allow secure handling files and support encryption

Python suite to construct benchmark machine learning datasets from the MIMIC-III clinical database.

Context-Aware Image Matting for Simultaneous Foreground and Alpha Estimation

Keyword2Text This repository contains the code of the paper: "A Plug-and-Play Method for Controlled Text Generation"

Memory Defense: More Robust Classificationvia a Memory-Masking Autoencoder

Voice Conversion Using Speech-to-Speech Neuro-Style Transfer

An end-to-end machine learning library to directly optimize AUC loss

A list of Machine Learning Art Colabs

Awesome Weak-Shot Learning

Waymo motion prediction challenge 2021: 3rd place solution

Implementation of ETSformer, state of the art time-series Transformer, in Pytorch