TensorFlow implementation of original paper : https://github.com/hszhao/PSPNet

Last update: Dec 29, 2022

Overview

Keras implementation of PSPNet(caffe)

Implemented Architecture of Pyramid Scene Parsing Network in Keras.

For the best compability please use Python3.5

Setup

Install dependencies:
- Tensorflow (-gpu)
- Keras
- numpy
- scipy
- pycaffe(PSPNet)(optional for converting the weights)
```
pip install -r requirements.txt --upgrade
```
Converted trained weights are needed to run the network. Weights(in .h5 .json format) have to be downloaded and placed into directory weights/keras

Already converted weights can be downloaded here:

Convert weights by yourself(optional)

(Note: this is not required if you use .h5/.json weights)

Running this needs the compiled original PSPNet caffe code and pycaffe.

python weight_converter.py <path to .prototxt> <path to .caffemodel>

Usage:

python pspnet.py -m <model> -i <input_image>  -o <output_path>
python pspnet.py -m pspnet101_cityscapes -i example_images/cityscapes.png -o example_results/cityscapes.jpg
python pspnet.py -m pspnet101_voc2012 -i example_images/pascal_voc.jpg -o example_results/pascal_voc.jpg

List of arguments:

 -m --model        - which model to use: 'pspnet50_ade20k', 'pspnet101_cityscapes', 'pspnet101_voc2012'
    --id           - (int) GPU Device id. Default 0
 -s --sliding      - Use sliding window
 -f --flip         - Additional prediction of flipped image
 -ms --multi_scale - Predict on multiscale images

Keras results:

Implementation details

The interpolation layer is implemented as custom layer "Interp"
Forward step takes about ~1 sec on single image

Memory usage can be optimized with:

config = tf.ConfigProto()
config.gpu_options.per_process_gpu_memory_fraction = 0.3 
sess = tf.Session(config=config)

ndimage.zoom can take a long time

TensorFlow implementation of original paper : https://github.com/hszhao/PSPNet

Related tags

Overview

Keras implementation of PSPNet(caffe)

Setup

Convert weights by yourself(optional)

Usage:

Keras results:

Implementation details

Owner

VladKry

Deploy tensorflow graphs for fast evaluation and export to tensorflow-less environments running numpy.

Adaptable tools to make reinforcement learning and evolutionary computation algorithms.

The backbone CSPDarkNet of YOLOX.

Reviatalizing Optimization for 3D Human Pose and Shape Estimation: A Sparse Constrained Formulation

This is the repo of the manuscript "Dual-branch Attention-In-Attention Transformer for speech enhancement"

A no-BS, dead-simple training visualizer for tf-keras

VOneNet: CNNs with a Primary Visual Cortex Front-End

Bunch of different tools which helps visualizing and annotating images for semantic/instance segmentation tasks

Python package to add text to images, textures and different backgrounds

To build a regression model to predict the concrete compressive strength based on the different features in the training data.

Official code of the paper "ReDet: A Rotation-equivariant Detector for Aerial Object Detection" (CVPR 2021)

Combinatorially Hard Games where the levels are procedurally generated

Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

Code for the paper titled "Generalized Depthwise-Separable Convolutions for Adversarially Robust and Efficient Neural Networks" (NeurIPS 2021 Spotlight).

This repository contains the re-implementation of our paper deSpeckNet: Generalizing Deep Learning Based SAR Image Despeckling

Code base for the paper "Scalable One-Pass Optimisation of High-Dimensional Weight-Update Hyperparameters by Implicit Differentiation"

Replication Package for AequeVox:Automated Fariness Testing for Speech Recognition Systems

Alignment Attention Fusion framework for Few-Shot Object Detection

Code for Reciprocal Adversarial Learning for Brain Tumor Segmentation: A Solution to BraTS Challenge 2021 Segmentation Task

Finite-temperature variational Monte Carlo calculation of uniform electron gas using neural canonical transformation.