TensorFlow-based implementation of "Pyramid Scene Parsing Network".

Last update: Dec 20, 2022

Overview

PSPNet_tensorflow

Important

Code is fine for inference. However, the training code is just for reference and might be only used for fine-tuning. If you want to train from scratch, you need to implement the Synchronize BN layer first to do large batch-size training (as described in the paper). It seems that this repo has reproduced it, you can take a look on it.

Introduction

This is an implementation of PSPNet in TensorFlow for semantic segmentation on the cityscapes dataset. We first convert weight from Original Code by using caffe-tensorflow framework.

Update:

News (2018.11.08 updated):

Now you can try PSPNet on your own image online using ModelDepot live demo!

2018/01/24:

Support evaluation code for ade20k dataset

2018/01/19:

Support inference phase for ade20k dataset using model of pspnet50 (convert weights from original author)
Using tf.matmul to decode label, so as to improve the speed of inference.

2017/11/06:

Support different input size by padding input image to (720, 720) if original size is smaller than it, and get result by cropping image in the end.

2017/10/27:

Change bn layer from tf.nn.batch_normalization into tf.layers.batch_normalization in order to support training phase. Also update initial model in Google Drive.

Install

Get restore checkpoint from Google Drive and put into model directory. Note: Select the checkpoint corresponding to the dataset.

Inference

To get result on your own images, use the following command:

python inference.py --img-path=./input/test.png --dataset cityscapes

Inference time: ~0.6s

Options:

--dataset cityscapes or ade20k
--flipped-eval 
--checkpoints /PATH/TO/CHECKPOINT_DIR

Evaluation

Cityscapes

Perform in single-scaled model on the cityscapes validation datase.

Method	Accuracy
Without flip	76.99%
Flip	77.23%

ade20k

Method	Accuracy
Without flip	40.00%
Flip	40.67%

To re-produce evluation results, do following steps:

Download Cityscape dataset or ADE20k dataset first.
change data_dir to your dataset path in evaluate.py:

'data_dir': ' = /Path/to/dataset'

Run the following command:

python evaluate.py --dataset cityscapes

List of Args:

--dataset - ade20k or cityscapes
--flipped-eval  - Using flipped evaluation method
--measure-time  - Calculate inference time

Image Result

cityscapes

Input image	Output image

ade20k

Input image	Output image

real world

Input image	Output image

Citation

@article{zhao2017pspnet,
  author = {Hengshuang Zhao and
            Jianping Shi and
            Xiaojuan Qi and
            Xiaogang Wang and
            Jiaya Jia},
  title = {Pyramid Scene Parsing Network},
  booktitle = {Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year = {2017}
}

Scene Parsing through ADE20K Dataset. B. Zhou, H. Zhao, X. Puig, S. Fidler, A. Barriuso and A. Torralba. Computer Vision and Pattern Recognition (CVPR), 2017. (http://people.csail.mit.edu/bzhou/publication/scene-parse-camera-ready.pdf)

@inproceedings{zhou2017scene,
    title={Scene Parsing through ADE20K Dataset},
    author={Zhou, Bolei and Zhao, Hang and Puig, Xavier and Fidler, Sanja and Barriuso, Adela and Torralba, Antonio},
    booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
    year={2017}
}

Semantic Understanding of Scenes through ADE20K Dataset. B. Zhou, H. Zhao, X. Puig, S. Fidler, A. Barriuso and A. Torralba. arXiv:1608.05442. (https://arxiv.org/pdf/1608.05442.pdf)

@article{zhou2016semantic,
  title={Semantic understanding of scenes through the ade20k dataset},
  author={Zhou, Bolei and Zhao, Hang and Puig, Xavier and Fidler, Sanja and Barriuso, Adela and Torralba, Antonio},
  journal={arXiv preprint arXiv:1608.05442},
  year={2016}
}

TensorFlow-based implementation of "Pyramid Scene Parsing Network".

Related tags

Overview

PSPNet_tensorflow

Important

Introduction

Update:

News (2018.11.08 updated):

2018/01/24:

2018/01/19:

2017/11/06:

2017/10/27:

Install

Inference

Evaluation

Cityscapes

ade20k

Image Result

cityscapes

ade20k

real world

Citation

Owner

HsuanKung Yang

Large Scale Multi-Illuminant (LSMI) Dataset for Developing White Balance Algorithm under Mixed Illumination

Instant-nerf-pytorch - NeRF trained SUPER FAST in pytorch

Code corresponding to The Introspective Agent: Interdependence of Strategy, Physiology, and Sensing for Embodied Agents

ServiceX Transformer that converts flat ROOT ntuples into columnwise data

Codes for CIKM'21 paper 'Self-Supervised Graph Co-Training for Session-based Recommendation'.

You Only 👀 One Sequence

Source code for "Roto-translated Local Coordinate Framesfor Interacting Dynamical Systems"

Parameter-ensemble-differential-evolution - Shows how to do parameter ensembling using differential evolution.

商品推荐系统

A Robust Unsupervised Ensemble of Feature-Based Explanations using Restricted Boltzmann Machines

BLEND: A Fast, Memory-Efficient, and Accurate Mechanism to Find Fuzzy Seed Matches

Code and data (Incidents Dataset) for ECCV 2020 Paper "Detecting natural disasters, damage, and incidents in the wild".

PyTorch deep learning projects made easy.

Collaborative forensic timeline analysis

Sematic-Segmantation - Semantic Segmentation on MIT ADE20K dataset in PyTorch

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.

[ICCV 2021] Deep Hough Voting for Robust Global Registration

Distinguishing Commercial from Editorial Content in News

Implementation of our NeurIPS 2021 paper "A Bi-Level Framework for Learning to Solve Combinatorial Optimization on Graphs".

Official implementation of "Open-set Label Noise Can Improve Robustness Against Inherent Label Noise" (NeurIPS 2021)