Region-aware Contrastive Learning for Semantic Segmentation, ICCV 2021

Abstract

Recent works have made great success in semantic segmentation by exploiting contextual information in a local or global manner within individual image and supervising the model with pixel-wise cross entropy loss. However, from the holistic view of the whole dataset, semantic relations not only exist inside one single image, but also prevail in the whole training data, which makes solely considering intra-image correlations insufficient. Inspired by recent progress in unsupervised contrastive learning, we propose the region-aware contrastive learning (RegionContrast) for semantic segmentation in the supervised manner. In order to enhance the similarity of semantically similar pixels while keeping the discrimination from others, we employ contrastive learning to realize this objective. With the help of memory bank, we explore to store all the representative features into the memory. Without loss of generality, to efficiently incorporate all training data into the memory bank while avoiding taking too much computation resource, we propose to construct region centers to represent features from different categories for every image. Hence, the proposed region-aware contrastive learning is performed in a region level for all the training data, which saves much more memory than methods exploring the pixel-level relations. The proposed RegionContrast brings little computation cost during training and requires no extra overhead for testing. Extensive experiments demonstrate that our method achieves state-of-the-art performance on three benchmark datasets including Cityscapes, ADE20K and COCO Stuff. For more details, please refer to our ICCV paper (paper).

Installation

Check INSTALL.md for installation instructions.

Training and Evaluation

cd experiments/v3_contrast
bash train.sh

Citation

@InProceedings{Hu_2021_ICCV,
    author    = {Hu, Hanzhe and Cui, Jinshi and Wang, Liwei},
    title     = {Region-Aware Contrastive Learning for Semantic Segmentation},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {16291-16301}
}

TODO

Dynamic Sampling

Region-aware Contrastive Learning for Semantic Segmentation, ICCV 2021

Related tags

Overview

Region-aware Contrastive Learning for Semantic Segmentation, ICCV 2021

Abstract

Installation

Training and Evaluation

Citation

TODO

Owner

Hanzhe Hu

Official Pytorch implementation of "Learning Debiased Representation via Disentangled Feature Augmentation (Neurips 2021, Oral)"

Semantic Segmentation of images using PixelLib with help of Pascalvoc dataset trained with Deeplabv3+ framework.

GLODISMO: Gradient-Based Learning of Discrete Structured Measurement Operators for Signal Recovery

Sharpness-Aware Minimization for Efficiently Improving Generalization

PyTorch implementation code for the paper MixCo: Mix-up Contrastive Learning for Visual Representation

Forecasting with Gradient Boosted Time Series Decomposition

Multi-query Video Retreival

PyTorch-lightning implementation of the ESFW module proposed in our paper Edge-Selective Feature Weaving for Point Cloud Matching

Code for DeepCurrents: Learning Implicit Representations of Shapes with Boundaries

Code for "Long Range Probabilistic Forecasting in Time-Series using High Order Statistics"

Offcial implementation of "A Hybrid Video Anomaly Detection Framework via Memory-Augmented Flow Reconstruction and Flow-Guided Frame Prediction, ICCV-2021".

Breaking Shortcut: Exploring Fully Convolutional Cycle-Consistency for Video Correspondence Learning

PyTorch Lightning implementation of Automatic Speech Recognition

Occlusion robust 3D face reconstruction model in CFR-GAN (WACV 2022)

Localized representation learning from Vision and Text (LoVT)

Train Dense Passage Retriever (DPR) with a single GPU

FLVIS: Feedback Loop Based Visual Initial SLAM

Source code for CVPR2022 paper "Abandoning the Bayer-Filter to See in the Dark"

Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attention Does Not Need O(n²) Memory"

Securetar - A streaming wrapper around python tarfile and allow secure handling files and support encryption