Region-aware Contrastive Learning for Semantic Segmentation, ICCV 2021

Last update: Dec 29, 2022

Overview

Region-aware Contrastive Learning for Semantic Segmentation, ICCV 2021

Abstract

Recent works have made great success in semantic segmentation by exploiting contextual information in a local or global manner within individual image and supervising the model with pixel-wise cross entropy loss. However, from the holistic view of the whole dataset, semantic relations not only exist inside one single image, but also prevail in the whole training data, which makes solely considering intra-image correlations insufficient. Inspired by recent progress in unsupervised contrastive learning, we propose the region-aware contrastive learning (RegionContrast) for semantic segmentation in the supervised manner. In order to enhance the similarity of semantically similar pixels while keeping the discrimination from others, we employ contrastive learning to realize this objective. With the help of memory bank, we explore to store all the representative features into the memory. Without loss of generality, to efficiently incorporate all training data into the memory bank while avoiding taking too much computation resource, we propose to construct region centers to represent features from different categories for every image. Hence, the proposed region-aware contrastive learning is performed in a region level for all the training data, which saves much more memory than methods exploring the pixel-level relations. The proposed RegionContrast brings little computation cost during training and requires no extra overhead for testing. Extensive experiments demonstrate that our method achieves state-of-the-art performance on three benchmark datasets including Cityscapes, ADE20K and COCO Stuff. For more details, please refer to our ICCV paper (paper).

Installation

Check INSTALL.md for installation instructions.

Training and Evaluation

cd experiments/v3_contrast
bash train.sh

Citation

@InProceedings{Hu_2021_ICCV,
    author    = {Hu, Hanzhe and Cui, Jinshi and Wang, Liwei},
    title     = {Region-Aware Contrastive Learning for Semantic Segmentation},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {16291-16301}
}

TODO

Dynamic Sampling

Region-aware Contrastive Learning for Semantic Segmentation, ICCV 2021

Related tags

Overview

Region-aware Contrastive Learning for Semantic Segmentation, ICCV 2021

Abstract

Installation

Training and Evaluation

Citation

TODO

Owner

Hanzhe Hu

A simple pygame dino game which can also be trained and played by a NEAT KI

Official code for article "Expression is enough: Improving traﬀic signal control with advanced traﬀic state representation"

A quantum game modeling of pandemic (QHack 2022)

A real-time approach for mapping all human pixels of 2D RGB images to a 3D surface-based model of the body

Low Complexity Channel estimation with Neural Network Solutions

Official implementation for paper: A Latent Transformer for Disentangled Face Editing in Images and Videos.

Implementation of Deep Deterministic Policy Gradiet Algorithm in Tensorflow

CN24 is a complete semantic segmentation framework using fully convolutional networks

Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval.

SCALE: Modeling Clothed Humans with a Surface Codec of Articulated Local Elements (CVPR 2021)

Official implementation of particle-based models (GNS and DPI-Net) on the Physion dataset.

NeRViS: Neural Re-rendering for Full-frame Video Stabilization

Toolbox to analyze temporal context invariance of deep neural networks

LaneDetectionAndLaneKeeping - Lane Detection And Lane Keeping

Official PyTorch implementation of CAPTRA: CAtegory-level Pose Tracking for Rigid and Articulated Objects from Point Clouds

HarDNeXt: Official HarDNeXt repository

Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos

MutualGuide is a compact object detector specially designed for embedded devices

Fully Convolutional Networks for Semantic Segmentation by Jonathan Long, Evan Shelhamer, and Trevor Darrell. CVPR 2015 and PAMI 2016.

Some methods for comparing network representations in deep learning and neuroscience.

Region-aware Contrastive Learning for Semantic Segmentation, ICCV 2021

Related tags

Overview

Region-aware Contrastive Learning for Semantic Segmentation, ICCV 2021

Abstract

Installation

Training and Evaluation

Citation

TODO

Owner

Hanzhe Hu

A simple pygame dino game which can also be trained and played by a NEAT KI

Official code for article "Expression is enough: Improving traﬀic signal control with advanced traﬀic state representation"

A quantum game modeling of pandemic (QHack 2022)

A real-time approach for mapping all human pixels of 2D RGB images to a 3D surface-based model of the body

Low Complexity Channel estimation with Neural Network Solutions

Official implementation for paper: A Latent Transformer for Disentangled Face Editing in Images and Videos.

Implementation of Deep Deterministic Policy Gradiet Algorithm in Tensorflow

CN24 is a complete semantic segmentation framework using fully convolutional networks

Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval.

SCALE: Modeling Clothed Humans with a Surface Codec of Articulated Local Elements (CVPR 2021)

Official implementation of particle-based models (GNS and DPI-Net) on the Physion dataset.

NeRViS: Neural Re-rendering for Full-frame Video Stabilization

Toolbox to analyze temporal context invariance of deep neural networks

LaneDetectionAndLaneKeeping - Lane Detection And Lane Keeping

Official PyTorch implementation of CAPTRA: CAtegory-level Pose Tracking for Rigid and Articulated Objects from Point Clouds

HarDNeXt: Official HarDNeXt repository

Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos

MutualGuide is a compact object detector specially designed for embedded devices

Fully Convolutional Networks for Semantic Segmentation by Jonathan Long*, Evan Shelhamer*, and Trevor Darrell. CVPR 2015 and PAMI 2016.

Some methods for comparing network representations in deep learning and neuroscience.

Fully Convolutional Networks for Semantic Segmentation by Jonathan Long, Evan Shelhamer, and Trevor Darrell. CVPR 2015 and PAMI 2016.