Complementary Patch for Weakly Supervised Semantic Segmentation, ICCV21 (poster)

Last update: Dec 12, 2022

Related tags

Overview

CPN (ICCV2021)

This is an implementation of Complementary Patch for Weakly Supervised Semantic Segmentation, which is accepted by ICCV2021 poster.

This implementation is based on SEAM and IRN.

Abstract

Weakly Supervised Semantic Segmentation (WSSS) based on image-level labels has been greatly advanced by exploiting the outputs of Class Activation Map (CAM) to generate the pseudo labels for semantic segmentation. However, CAM merely discovers seeds from a small number of regions, which may be insufficient to serve as pseudo masks for semantic segmentation. In this paper, we formulate the expansion of object regions in CAM as an increase in information. From the perspective of information theory, we propose a novel Complementary Patch (CP) Representation and prove that the information of the sum of the CAMs by a pair of input images with complementary hidden (patched) parts, namely CP Pair, is greater than or equal to the information of the baseline CAM. Therefore, a CAM with more information related to object seeds can be obtained by narrowing down the gap between the sum of CAMs generated by the CP Pair and the original CAM. We propose a CP Network (CPN) implemented by a triplet network and three regularization functions. To further improve the quality of the CAMs, we propose a Pixel-Region Correlation Module (PRCM) to augment the contextual information by using object-region relations between the feature maps and the CAMs. Experimental results on the PASCAL VOC 2012 datasets show that our proposed method achieves a new state-of-the-art in WSSS, validating the effectiveness of our CP Representation and CPN.

Prerequisite

The requirements are in requirements.txt. However, the settings are not limited to it (CUDA 11.0, Pytorch 1.7 for one RTX3090). Besides,the batch size could be even larger like 8 or 16 if you have sufficient GPU resources, which you may get higher performance than the paper reported.
The pretrained_weight for the initialization of ResNet38 and well-trained CPN is here in BaiDuYun, and the code is y6h4, or you could find them in Google Drive, which is here.
PASCAL VOC 2012 devkit with expanded version, which includes 10582 training samples.

Usage

Train the CPN to obtain the weight, which will be saved in "CPN/CPN". Remember to set the VOC12 and pre-trained weight path in the script.
```
python train_cpn.py
```
Generate the foreground seeds of CAM (without background) using the weight or the well-trained CPN, the results is in out_cam.
```
python infer_cam.py 
```
Evaluate the CAM by selecting the background. Remember to set the data path of VOC in this script.
```
python evaluation_cam.py
```

Implementation of results in paper

I suggest to use the IRN and the for the second expansion of the CAM. Although you can directly use the old version of AffinityNet, you may take long time to find the parameters to generate the CAM that reaches the reported performance. You can directly use the well-trained weights from IRN to generated the mask for segmentation.
For the segmentation model, we use the DeepLab here.

Acknowledgement

Great thanks to the code of the SEAM and IRN.

Complementary Patch for Weakly Supervised Semantic Segmentation, ICCV21 (poster)

Related tags

Overview

CPN (ICCV2021)

Abstract

Prerequisite

Usage

Implementation of results in paper

Acknowledgement

Owner

Ferenas

TensorFlow 101: Introduction to Deep Learning for Python Within TensorFlow

Benchmarks for the Optimal Power Flow Problem

FIRM-AFL is the first high-throughput greybox fuzzer for IoT firmware.

FinRL-Meta: A Universe for Data-Driven Financial Reinforcement Learning. 🔥

Generative Exploration and Exploitation - This is an improved version of GENE.

ByteTrack with ReID module following the paradigm of FairMOT, tracking strategy is borrowed from FairMOT/JDE.

IAUnet: Global Context-Aware Feature Learning for Person Re-Identification

The datasets and code of ACL 2021 paper "Aspect-Category-Opinion-Sentiment Quadruple Extraction with Implicit Aspects and Opinions".

JAXDL: JAX (Flax) Deep Learning Library

Code for the TIP 2021 Paper "Salient Object Detection with Purificatory Mechanism and Structural Similarity Loss"

SIMULEVAL A General Evaluation Toolkit for Simultaneous Translation

Automatically align face images 🙃→🙂. Can also do windowing and warping.

Image-to-image translation with conditional adversarial nets

The Unsupervised Reinforcement Learning Benchmark (URLB)

Automatic Differentiation Multipole Moment Molecular Forcefield

PiCIE: Unsupervised Semantic Segmentation using Invariance and Equivariance in clustering (CVPR2021)

This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP

Copy Paste positive polyp using poisson image blending for medical image segmentation

Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".

Codes and Data Processing Files for our paper.

Complementary Patch for Weakly Supervised Semantic Segmentation, ICCV21 (poster)

Related tags

Overview

CPN (ICCV2021)

Abstract

Prerequisite

Usage

Implementation of results in paper

Acknowledgement

Owner

Ferenas

TensorFlow 101: Introduction to Deep Learning for Python Within TensorFlow

Benchmarks for the Optimal Power Flow Problem

FIRM-AFL is the first high-throughput greybox fuzzer for IoT firmware.

FinRL­-Meta: A Universe for Data­-Driven Financial Reinforcement Learning. 🔥

Generative Exploration and Exploitation - This is an improved version of GENE.

ByteTrack with ReID module following the paradigm of FairMOT, tracking strategy is borrowed from FairMOT/JDE.

IAUnet: Global Context-Aware Feature Learning for Person Re-Identification

The datasets and code of ACL 2021 paper "Aspect-Category-Opinion-Sentiment Quadruple Extraction with Implicit Aspects and Opinions".

JAXDL: JAX (Flax) Deep Learning Library

Code for the TIP 2021 Paper "Salient Object Detection with Purificatory Mechanism and Structural Similarity Loss"

SIMULEVAL A General Evaluation Toolkit for Simultaneous Translation

Automatically align face images 🙃→🙂. Can also do windowing and warping.

Image-to-image translation with conditional adversarial nets

The Unsupervised Reinforcement Learning Benchmark (URLB)

Automatic Differentiation Multipole Moment Molecular Forcefield

PiCIE: Unsupervised Semantic Segmentation using Invariance and Equivariance in clustering (CVPR2021)

This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP

Copy Paste positive polyp using poisson image blending for medical image segmentation

Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".

Codes and Data Processing Files for our paper.

FinRL-Meta: A Universe for Data-Driven Financial Reinforcement Learning. 🔥