PyTorch implementation of DirectCLR from paper Understanding Dimensional Collapse in Contrastive Self-supervised Learning

Last update: Dec 21, 2022

Related tags

Deep Learning directclr

Overview

DirectCLR

DirectCLR is a simple contrastive learning model for visual representation learning. It does not require a trainable projector as SimCLR. It is able to prevent dimensional collapse and outperform SimCLR with a linear projector.

PyTorch implementation of DirectCLR from paper Understanding Dimensional Collapse in Contrastive Self-supervised Learning.

@article{Jing2021UnderstandingDC,
  title={Understanding Dimensional Collapse in Contrastive Self-supervised Learning},
  author={Li Jing and Pascal Vincent and Yann LeCun and Yuandong Tian},
  journal={arXiv preprint arXiv:2110.09348},
  year={2021}
}

DirectCLR Training

Install PyTorch and download ImageNet by following the instructions in the requirements section of the PyTorch ImageNet training example. The code has been developed for PyTorch version 1.7.1 and torchvision version 0.8.2, but it should work with other versions just as well.

Our best model is obtained by running the following command:

python main.py --data /path/to/imagenet/ --mode directclr --dim 360

Mode can be chosen as:

simclr: standard SimCLR with two layer nonlinear projector;

single: SimCLR with single layer linear projector;

baseline: SimCLR without a projector;

directclr: DirectCLR with single layer linear projector;

Training time is approximately 7 hours on 32 v100 GPUs.

Evaluation: Linear Classification

Train a linear probe on the representations. Freeze the weights of the resnet and use the entire ImageNet training set.

python linear_probe.py /path/to/imagenet/ /path/to/checkpoint/resnet50.pth

Linear probe time is approximately 20 hours on 8 v100 GPUs.

License

This project is under the CC-BY-NC 4.0 license. See LICENSE for details.

PyTorch implementation of DirectCLR from paper Understanding Dimensional Collapse in Contrastive Self-supervised Learning

Related tags

Overview

DirectCLR

DirectCLR Training

Evaluation: Linear Classification

License

Owner

Meta Research

Scripts for training an AI to play the endless runner Subway Surfers using a supervised machine learning approach by imitation and a convolutional neural network (CNN) for image classification

Reduce end to end training time from days to hours (or hours to minutes), and energy requirements/costs by an order of magnitude using coresets and data selection.

Cascading Feature Extraction for Fast Point Cloud Registration (BMVC 2021)

A Closer Look at Structured Pruning for Neural Network Compression

Neural Architecture Search Powered by Swarm Intelligence 🐜

ADSPM: Attribute-Driven Spontaneous Motion in Unpaired Image Translation

End-To-End Optimization of LiDAR Beam Configuration

Automatic labeling, conversion of different data set formats, sample size statistics, model cascade

A parallel framework for population-based multi-agent reinforcement learning.

A script depending on VASP output for calculating Fermi-Softness.

NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.

A PyTorch Library for Accelerating 3D Deep Learning Research

Official implementation for Likelihood Regret: An Out-of-Distribution Detection Score For Variational Auto-encoder at NeurIPS 2020

Code for the paper A Theoretical Analysis of the Repetition Problem in Text Generation

A custom DeepStack model that has been trained detecting ONLY the USPS logo

Code for the paper "Zero-shot Natural Language Video Localization" (ICCV2021, Oral).

1st ranked 'driver careless behavior detection' for AI Online Competition 2021, hosted by MSIT Korea.

SMIS - Semantically Multi-modal Image Synthesis(CVPR 2020)

Weakly-Supervised Semantic Segmentation Network with Deep Seeded Region Growing (CVPR 2018).

This is the source code for: Context-aware Entity Typing in Knowledge Graphs.