[CVPR 2022 Oral] Rethinking Minimal Sufficient Representation in Contrastive Learning

Last update: Nov 23, 2022

Related tags

Overview

Rethinking Minimal Sufficient Representation in Contrastive Learning

PyTorch implementation of
Rethinking Minimal Sufficient Representation in Contrastive Learning
Haoqing Wang, Xun Guo, Zhi-hong Deng, Yan Lu

CVPR 2022 Oral

Abstract

Contrastive learning between different views of the data achieves outstanding success in the field of self-supervised representation learning and the learned representations are useful in broad downstream tasks. Since all supervision information for one view comes from the other view, contrastive learning approximately obtains the minimal sufficient representation which contains the shared information and eliminates the non-shared information between views. Considering the diversity of the downstream tasks, it cannot be guaranteed that all task-relevant information is shared between views. Therefore, we assume the non-shared task-relevant information cannot be ignored and theoretically prove that the minimal sufficient representation in contrastive learning is not sufficient for the downstream tasks, which causes performance degradation. This reveals a new problem that the contrastive learning models have the risk of over-fitting to the shared information between views. To alleviate this problem, we propose to increase the mutual information between the representation and input as regularization to approximately introduce more task-relevant information, since we cannot utilize any downstream task information during training. Extensive experiments verify the rationality of our analysis and the effectiveness of our method. It significantly improves the performance of several classic contrastive learning models in downstream tasks.

Citation

If you use this code for your research, please cite our paper:

@inproceedings{wang2022rethinking,
  title={Rethinking Minimal Sufficient Representation in Contrastive Learning},
  author={Wang, Haoqing and Deng, Zhi-hong and Guo, Xun and Lu, Yan},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={xx--xx},
  year={2022}
}

Note

This code is built upon the implementation from moco and CLAE.
The dataset, model, and code are for non-commercial research purposes only.

[CVPR 2022 Oral] Rethinking Minimal Sufficient Representation in Contrastive Learning

Related tags

Overview

Rethinking Minimal Sufficient Representation in Contrastive Learning

Abstract

Citation

Note

Owner

Mining-the-Social-Web-3rd-Edition - The official online compendium for Mining the Social Web, 3rd Edition (O'Reilly, 2018)

[TIP2020] Adaptive Graph Representation Learning for Video Person Re-identification

This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-clustering.

Diabet Feature Engineering - Predict whether people have diabetes when their characteristics are specified

Reinfore learning tool box, contains trpo, a3c algorithm for continous action space

Implementation for NeurIPS 2021 Submission: SparseFed

Image data augmentation scheduler for albumentations transforms

Code for the paper "Reinforced Active Learning for Image Segmentation"

Pytorch implementation for "Open Compound Domain Adaptation" (CVPR 2020 ORAL)

ROS support for Velodyne 3D LIDARs

E-RAFT: Dense Optical Flow from Event Cameras

Official implementation of the article "Unsupervised JPEG Domain Adaptation For Practical Digital Forensics"

Repository for MDPGT

[CVPR2021 Oral] FFB6D: A Full Flow Bidirectional Fusion Network for 6D Pose Estimation.

This is a tensorflow-based rotation detection benchmark, also called AlphaRotate.

A lightweight library to compare different PyTorch implementations of the same network architecture.

Fast mesh denoising with data driven normal filtering using deep variational autoencoders

Script utilizando OpenCV e modelo Machine Learning para detectar o uso de máscaras.

SpinalNet: Deep Neural Network with Gradual Input

Official PyTorch implementation of the Fishr regularization for out-of-distribution generalization