Applying PVT to Semantic Segmentation

Last update: Nov 30, 2022

Related tags

Deep Learning PVTv2-Seg

Overview

Applying PVT to Semantic Segmentation

Here, we take MMSegmentation v0.13.0 as an example, applying PVTv2 to SemanticFPN.

For details see Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions.

If you use this code for a paper please cite:

@misc{wang2021pyramid,
      title={Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions}, 
      author={Wenhai Wang and Enze Xie and Xiang Li and Deng-Ping Fan and Kaitao Song and Ding Liang and Tong Lu and Ping Luo and Ling Shao},
      year={2021},
      eprint={2102.12122},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Usage

Install MMSegmentation.

Data preparation

First, prepare ADE20K according to the guidelines in MMSegmentation.

Then, download the weights pretrained on ImageNet at here, and put them in a folder pretrained/

Results and models

Backbone	Iters	mIoU	Config
PVTv2-B0 + Semantic FPN	40K	37.2	config
PVTv2-B1 + Semantic FPN	40K	42.5	config
PVTv2-B2 + Semantic FPN	40K	45.2	config
PVTv2-B3 + Semantic FPN	40K	47.3	config
PVTv2-B4 + Semantic FPN	40K	47.9	config
PVTv2-B5 + Semantic FPN	40K	48.7	config

Evaluation

To evaluate PVTv2-B2 + SemFPN on a single node with 8 gpus run:

dist_test.sh configs/sem_fpn/PVT/fpn_pvtv2_b2_ade20k_40k.py /path/to/checkpoint_file 8 --out results.pkl --eval mIoU

Training

To train PVTv2-B2 + SemFPN on a single node with 8 gpus run:

dist_train.sh configs/sem_fpn/PVT/fpn_pvtv2_b2_ade20k_40k.py 8

License

This repository is released under the Apache 2.0 license as found in the LICENSE file.

Applying PVT to Semantic Segmentation

Related tags

Overview

Applying PVT to Semantic Segmentation

Usage

Data preparation

Results and models

Evaluation

Training

License

Owner

Dados coletados e programas desenvolvidos no processo de iniciação científica

Benchmark for evaluating open-ended generation

Hyperparameter tuning for humans

PyTorch code for our paper "Attention in Attention Network for Image Super-Resolution"

Code for ICCV 2021 paper "Distilling Holistic Knowledge with Graph Neural Networks"

A Python package to process & model ChEMBL data.

A series of Python scripts to access measurements from Fluke 28X meters. Fluke IR Remote Interface required.

The code for paper Efficiently Solve the Max-cut Problem via a Quantum Qubit Rotation Algorithm

A Python package for time series augmentation

TSIT: A Simple and Versatile Framework for Image-to-Image Translation

PyTorch implementation for paper Neural Marching Cubes.

YuNetのPythonでのONNX、TensorFlow-Lite推論サンプル

PyTorch implementation of "Continual Learning with Deep Generative Replay", NIPS 2017

Multi-Template Mouse Brain MRI Atlas (MBMA): both in-vivo and ex-vivo

Official Pytorch implementation for "End2End Occluded Face Recognition by Masking Corrupted Features, TPAMI 2021"

Pytorch code for our paper "Feedback Network for Image Super-Resolution" (CVPR2019)

Very large and sparse networks appear often in the wild and present unique algorithmic opportunities and challenges for the practitioner

Mapping Conditional Distributions for Domain Adaptation Under Generalized Target Shift

Source code for Task-Aware Variational Adversarial Active Learning

Code for Deterministic Neural Networks with Appropriate Inductive Biases Capture Epistemic and Aleatoric Uncertainty