Reducing Information Bottleneck for Weakly Supervised Semantic Segmentation (NeurIPS 2021)

Last update: Dec 16, 2022

Overview

Reducing Information Bottleneck for Weakly Supervised Semantic Segmentation (NeurIPS 2021)

The implementation of Reducing Infromation Bottleneck for Weakly Supervised Semantic Segmentation, Jungbeom Lee, Jooyoung Choi, Jisoo Mok, and Sungroh Yoon, NeurIPS 2021. [[paper]]

Abstract

Weakly supervised semantic segmentation produces pixel-level localization from class labels; however, a classifier trained on such labels is likely to focus on a small discriminative region of the target object. We interpret this phenomenon using the information bottleneck principle: the final layer of a deep neural network, activated by the sigmoid or softmax activation functions, causes an information bottleneck, and as a result, only a subset of the task-relevant information is passed on to the output. We first support this argument through a simulated toy experiment and then propose a method to reduce the information bottleneck by removing the last activation function. In addition, we introduce a new pooling method that further encourages the transmission of information from non-discriminative regions to the classification. Our experimental evaluations demonstrate that this simple modification significantly improves the quality of localization maps on both the PASCAL VOC 2012 and MS COCO 2014 datasets, exhibiting a new state-of-the-art performance for weakly supervised semantic segmentation.

Installation

We kindly refer to the offical implementation of IRN.

Usage

Step 1. Prepare Dataset

Download Pascal VOC dataset here.
Download MS COCO images from the official COCO website here.
Download semantic segmentation annotations for the MS COCO dataset here.
Directory hierarchy

    Dataset
    ├── VOC2012_SEG_AUG       # unzip VOC2012_SEG_AUG.zip           
    ├── coco_2017             # mkdir coco_2017
    │   ├── coco_seg_anno     # included in coco_annotations_semantic.zip
    └── └── JPEGImages        # include train and val images downloaded from the official COCO website

Step 2. Prepare pre-trained classifier

Pre-trained model used in this paper: Pascal VOC, MS COCO.
You can also train your own classifiers following IRN.

Step 3. Generate and evaluate the pseudo ground-truth masks for PASCAL VOC and MS COCO

PASCAL VOC

bash get_pseudo_gt_VOC.sh

MS COCO

bash get_pseudo_gt_COCO.sh

Step 4. Train a semantic segmentation network

To train DeepLab-v2, we refer to deeplab-pytorch. However, this repo contains only COCO pre-trained model. We provide ImageNet pre-trained model for a fair comparison with the other methods.

Acknowledgment

This code is heavily borrowed from IRN, thanks jiwoon-ahn!

Reducing Information Bottleneck for Weakly Supervised Semantic Segmentation (NeurIPS 2021)

Related tags

Overview

Reducing Information Bottleneck for Weakly Supervised Semantic Segmentation (NeurIPS 2021)

Abstract

Installation

Usage

Step 1. Prepare Dataset

Step 2. Prepare pre-trained classifier

Step 3. Generate and evaluate the pseudo ground-truth masks for PASCAL VOC and MS COCO

Step 4. Train a semantic segmentation network

Acknowledgment

Owner

Jungbeom Lee

A script written in Python that returns a consensus string and profile matrix of a given DNA string(s) in FASTA format.

(IEEE TIP 2021) Regularized Densely-connected Pyramid Network for Salient Instance Segmentation

Sign-to-Speech for Sign Language Understanding: A case study of Nigerian Sign Language

Official public repository of paper "Intention Adaptive Graph Neural Network for Category-Aware Session-Based Recommendation"

This is the code repository for the paper A hierarchical semantic segmentation framework for computer-vision-based bridge column damage detection

Learning Temporal Consistency for Low Light Video Enhancement from Single Images (CVPR2021)

Simultaneous NMT/MMT framework in PyTorch

"Exploring Vision Transformers for Fine-grained Classification" at CVPRW FGVC8

Deep learning with TensorFlow and earth observation data.

RSNA Intracranial Hemorrhage Detection with python

P-Tuning v2: Prompt Tuning Can Be Comparable to Finetuning Universally Across Scales and Tasks

Joint Unsupervised Learning (JULE) of Deep Representations and Image Clusters.

UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model

Official release of MSHT: Multi-stage Hybrid Transformer for the ROSE Image Analysis of Pancreatic Cancer axriv: http://arxiv.org/abs/2112.13513

harmonic-percussive-residual separation algorithm wrapped as a VST3 plugin (iPlug2)

Extremely simple and fast extreme multi-class and multi-label classifiers.

Unofficial PyTorch Implementation of "Augmenting Convolutional networks with attention-based aggregation"

Implementation of SiameseXML (ICML 2021)

Cowsay - A rewrite of cowsay in python

A Pytorch implementation of CVPR 2021 paper "RSG: A Simple but Effective Module for Learning Imbalanced Datasets"