PyTorch implementation of PSPNet

Last update: Nov 16, 2022

Overview

PSPNet with PyTorch

Unofficial implementation of "Pyramid Scene Parsing Network" (https://arxiv.org/abs/1612.01105). This repository is just for caffe to pytorch model conversion and evaluation.

Requirements

pytorch
click
addict
pydensecrf
protobuf

Preparation

Instead of building the author's caffe implementation, you can convert off-the-shelf caffemodels to pytorch models via the caffe.proto.

1. Compile the `caffe.proto` for Python API

This step can be skipped. FYI.
Download the author's caffe.proto into the libs, not the one in the original caffe.

# For protoc command
pip install protobuf
# This generates ./caffe_pb2.py
protoc --python_out=. caffe.proto

2. Model conversion

Find the caffemodels on the author's page (e.g. pspnet50_ADE20K.caffemodel) and store them to the data/models/ directory.
Convert the caffemodels to .pth file.

python convert.py -c <PATH TO YAML>

Demo

python demo.py -c <PATH TO YAML> -i <PATH TO IMAGE>

With a --no-cuda option, this runs on CPU.
With a --crf option, you can perform a CRF postprocessing.

Evaluation

PASCAL VOC2012 only. Please set the dataset path in config/voc12.yaml.

python eval.py -c config/voc12.yaml

88.1% mIoU (SS) and 88.6% mIoU (MS) on validation set.
NOTE: 3 points lower than caffe implementation. WIP

SS: averaged prediction with flipping (2x)
MS: averaged prediction with multi-scaling (6x) and flipping (2x)
Both: No CRF post-processing

References

Official implementation: https://github.com/hszhao/PSPNet
Chainer implementation: https://github.com/mitmul/chainer-pspnet

PyTorch implementation of PSPNet

Related tags

Overview

PSPNet with PyTorch

Requirements

Preparation

1. Compile the `caffe.proto` for Python API

2. Model conversion

Demo

Evaluation

References

Owner

Kazuto Nakashima

SGPT: Multi-billion parameter models for semantic search

SEAN: Image Synthesis with Semantic Region-Adaptive Normalization (CVPR 2020, Oral)

Hierarchical Clustering: O(1)-Approximation for Well-Clustered Graphs

A Python module for parallel optimization of expensive black-box functions

This repository contains the code used in the paper "Prompt-Based Multi-Modal Image Segmentation".

ConE: Cone Embeddings for Multi-Hop Reasoning over Knowledge Graphs

Julia package for contraction of tensor networks, based on the sweep line algorithm outlined in the paper General tensor network decoding of 2D Pauli codes

Get 2D point positions (e.g., facial landmarks) projected on 3D mesh

EssentialMC2 Video Understanding

No-reference Image Quality Assessment(NIQA) Algorithms (BRISQUE, NIQE, PIQE, RankIQA, MetaIQA)

AI Based Smart Exam Proctoring Package

Implementation of the paper "Generating Symbolic Reasoning Problems with Transformer GANs"

Repository of the paper Compressing Sensor Data for Remote Assistance of Autonomous Vehicles using Deep Generative Models at ML4AD @ NeurIPS 2021.

Deep learning with dynamic computation graphs in TensorFlow

Dynamic View Synthesis from Dynamic Monocular Video

DeOldify - A Deep Learning based project for colorizing and restoring old images (and video!)

AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation

Official implementation for Likelihood Regret: An Out-of-Distribution Detection Score For Variational Auto-encoder at NeurIPS 2020

Implementing SYNTHESIZER: Rethinking Self-Attention in Transformer Models using Pytorch

[CVPR22] Official codebase of Semantic Segmentation by Early Region Proxy.

PyTorch implementation of PSPNet

Related tags

Overview

PSPNet with PyTorch

Requirements

Preparation

1. Compile the caffe.proto for Python API

2. Model conversion

Demo

Evaluation

References

Owner

Kazuto Nakashima

SGPT: Multi-billion parameter models for semantic search

SEAN: Image Synthesis with Semantic Region-Adaptive Normalization (CVPR 2020, Oral)

Hierarchical Clustering: O(1)-Approximation for Well-Clustered Graphs

A Python module for parallel optimization of expensive black-box functions

This repository contains the code used in the paper "Prompt-Based Multi-Modal Image Segmentation".

ConE: Cone Embeddings for Multi-Hop Reasoning over Knowledge Graphs

Julia package for contraction of tensor networks, based on the sweep line algorithm outlined in the paper General tensor network decoding of 2D Pauli codes

Get 2D point positions (e.g., facial landmarks) projected on 3D mesh

EssentialMC2 Video Understanding

No-reference Image Quality Assessment(NIQA) Algorithms (BRISQUE, NIQE, PIQE, RankIQA, MetaIQA)

AI Based Smart Exam Proctoring Package

Implementation of the paper "Generating Symbolic Reasoning Problems with Transformer GANs"

Repository of the paper Compressing Sensor Data for Remote Assistance of Autonomous Vehicles using Deep Generative Models at ML4AD @ NeurIPS 2021.

Deep learning with dynamic computation graphs in TensorFlow

Dynamic View Synthesis from Dynamic Monocular Video

DeOldify - A Deep Learning based project for colorizing and restoring old images (and video!)

AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation

Official implementation for Likelihood Regret: An Out-of-Distribution Detection Score For Variational Auto-encoder at NeurIPS 2020

Implementing SYNTHESIZER: Rethinking Self-Attention in Transformer Models using Pytorch

[CVPR22] Official codebase of Semantic Segmentation by Early Region Proxy.

1. Compile the `caffe.proto` for Python API