A pytorch implementation of the CVPR2021 paper "VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild"

Last update: Nov 29, 2022

Related tags

Deep Learning CVPR2021_VSPW_Implement

Overview

VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild

A pytorch implementation of the CVPR2021 paper "VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild"

Preparation

Download VSPW dataset

The VSPW dataset with extracted frames and masks is available here. Now you can directly download VSPW_480P dataset.

Dependencies

Python 3.7
Pytorch 1.3.1
Numpy

Download the ImageNet-pretrained models at this link. Put it in the root folder and decompress it.

Train and Test

Resize the frames and masks of the VSPW dataset to 480p.

python change2_480p.py

Edit the .sh files in scripts/ and change the $DATAROOT to your path to VSPW_480p.

Image-based methods

PSPNet

sh scripts/run_psp.sh

OCRNet

sh scripts/run_ocr.sh

Video-based methods

TCB-PSP

sh run_temporal_psp.sh

TCB-OCR

sh run_temporal_ocr.sh

Evaluation on TC and VC

Change dataroot and prediction root in TC_cal.py and VC_perclip.py.

python TC_cal.py

python VC_perclip.py

This implementation utilized this code and RAFT.

Citation

@inproceedings{miao2021vspw,

  title={VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild},

  author={Miao, Jiaxu and Wei, Yunchao and  Wu, Yu and Liang, Chen and Li, Guangrui and Yang, Yi},

  booktitle={Proceedings of the {IEEE} Conference on Computer Vision and Pattern Recognition},

  year={2021}

}

A pytorch implementation of the CVPR2021 paper "VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild"

Related tags

Overview

VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild

Preparation

Download VSPW dataset

Dependencies

Train and Test

Image-based methods

Video-based methods

Evaluation on TC and VC

Citation

Owner

The repository offers the official implementation of our paper in PyTorch.

Code for our NeurIPS 2021 paper: Sparsely Changing Latent States for Prediction and Planning in Partially Observable Domains

JAX + dataclasses

Source code for Fathony, Sahu, Willmott, & Kolter, "Multiplicative Filter Networks", ICLR 2021.

Neural Tangent Generalization Attacks (NTGA)

Deep motion generator collections

Deep Learning Emotion decoding using EEG data from Autism individuals

Finetune alexnet with tensorflow - Code for finetuning AlexNet in TensorFlow >= 1.2rc0

Turn based roguelike in python

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Graph WaveNet apdapted for brain connectivity analysis.

Space-invaders - Simple Game created using Python & PyGame, as my Beginner Python Project

[内测中]前向式Python环境快捷封装工具，快速将Python打包为EXE并添加CUDA、NoAVX等支持。

《Geo Word Clouds》paper implementation

LRBoost is a scikit-learn compatible approach to performing linear residual based stacking/boosting.

Official Implementation of SWAD (NeurIPS 2021)

Official implementation for "QS-Attn: Query-Selected Attention for Contrastive Learning in I2I Translation" (CVPR 2022)

[CVPR 2022 Oral] Versatile Multi-Modal Pre-Training for Human-Centric Perception

A Python multilingual toolkit for Sentiment Analysis and Social NLP tasks

The official PyTorch code for NeurIPS 2021 ML4AD Paper, "Does Thermal data make the detection systems more reliable?"