TensorFlow implementation of original paper : https://github.com/hszhao/PSPNet

Last update: Dec 29, 2022

Overview

Keras implementation of PSPNet(caffe)

Implemented Architecture of Pyramid Scene Parsing Network in Keras.

For the best compability please use Python3.5

Setup

Install dependencies:
- Tensorflow (-gpu)
- Keras
- numpy
- scipy
- pycaffe(PSPNet)(optional for converting the weights)
```
pip install -r requirements.txt --upgrade
```
Converted trained weights are needed to run the network. Weights(in .h5 .json format) have to be downloaded and placed into directory weights/keras

Already converted weights can be downloaded here:

Convert weights by yourself(optional)

(Note: this is not required if you use .h5/.json weights)

Running this needs the compiled original PSPNet caffe code and pycaffe.

python weight_converter.py <path to .prototxt> <path to .caffemodel>

Usage:

python pspnet.py -m <model> -i <input_image>  -o <output_path>
python pspnet.py -m pspnet101_cityscapes -i example_images/cityscapes.png -o example_results/cityscapes.jpg
python pspnet.py -m pspnet101_voc2012 -i example_images/pascal_voc.jpg -o example_results/pascal_voc.jpg

List of arguments:

 -m --model        - which model to use: 'pspnet50_ade20k', 'pspnet101_cityscapes', 'pspnet101_voc2012'
    --id           - (int) GPU Device id. Default 0
 -s --sliding      - Use sliding window
 -f --flip         - Additional prediction of flipped image
 -ms --multi_scale - Predict on multiscale images

Keras results:

Implementation details

The interpolation layer is implemented as custom layer "Interp"
Forward step takes about ~1 sec on single image

Memory usage can be optimized with:

config = tf.ConfigProto()
config.gpu_options.per_process_gpu_memory_fraction = 0.3 
sess = tf.Session(config=config)

ndimage.zoom can take a long time

TensorFlow implementation of original paper : https://github.com/hszhao/PSPNet

Related tags

Overview

Keras implementation of PSPNet(caffe)

Setup

Convert weights by yourself(optional)

Usage:

Keras results:

Implementation details

Owner

VladKry

This is the pytorch code for the paper Curious Representation Learning for Embodied Intelligence.

SurvITE: Learning Heterogeneous Treatment Effects from Time-to-Event Data

A library that allows for inference on probabilistic models

Hippocampal segmentation using the UNet network for each axis

Official implementation of VaxNeRF (Voxel-Accelearated NeRF).

code for our ECCV 2020 paper "A Balanced and Uncertainty-aware Approach for Partial Domain Adaptation"

Job Assignment System by Real-time Emotion Detection

Light-Head R-CNN

Implementation of Deep Deterministic Policy Gradiet Algorithm in Tensorflow

Dynamic View Synthesis from Dynamic Monocular Video

PyTorch implementation of "LayoutTransformer: Layout Generation and Completion with Self-attention"

The Environment I built to study Reinforcement Learning + Pokemon Showdown

Code for a real-time distributed cooperative slam(RDC-SLAM) system for ROS compatible platforms.

Predictive Modeling on Electronic Health Records(EHR) using Pytorch

End-to-End Referring Video Object Segmentation with Multimodal Transformers

Modified prey-predator system - Modified prey–predator model describes the rate of change for each species by adding coupling terms.

Trading Gym is an open source project for the development of reinforcement learning algorithms in the context of trading.

Wide Residual Networks (WideResNets) in PyTorch

A trusty face recognition research platform developed by Tencent Youtu Lab

Pretrained SOTA Deep Learning models, callbacks and more for research and production with PyTorch Lightning and PyTorch