PyTorch implementation of SimSiam: Exploring Simple Siamese Representation Learning

Last update: Dec 30, 2022

Related tags

Overview

SimSiam: Exploring Simple Siamese Representation Learning

This is a PyTorch implementation of the SimSiam paper:

@Article{chen2020simsiam,
  author  = {Xinlei Chen and Kaiming He},
  title   = {Exploring Simple Siamese Representation Learning},
  journal = {arXiv preprint arXiv:2011.10566},
  year    = {2020},
}

Preparation

Install PyTorch and download the ImageNet dataset following the official PyTorch ImageNet training code. Similar to MoCo, the code release contains minimal modifications for both unsupervised pre-training and linear classification to that code.

In addition, install apex for the LARS implementation needed for linear classification.

Unsupervised Pre-Training

Only multi-gpu, DistributedDataParallel training is supported; single-gpu or DataParallel training is not supported.

To do unsupervised pre-training of a ResNet-50 model on ImageNet in an 8-gpu machine, run:

python main_simsiam.py \
  -a resnet50 \
  --dist-url 'tcp://localhost:10001' --multiprocessing-distributed --world-size 1 --rank 0 \
  --fix-pred-lr \
  [your imagenet-folder with train and val folders]

The script uses all the default hyper-parameters as described in the paper, and uses the default augmentation recipe from MoCo v2.

The above command performs pre-training with a non-decaying predictor learning rate for 100 epochs, corresponding to the last row of Table 1 in the paper.

Linear Classification

With a pre-trained model, to train a supervised linear classifier on frozen features/weights in an 8-gpu machine, run:

python main_lincls.py \
  -a resnet50 \
  --dist-url 'tcp://localhost:10001' --multiprocessing-distributed --world-size 1 --rank 0 \
  --pretrained [your checkpoint path]/checkpoint_0099.pth.tar \
  --lars \
  [your imagenet-folder with train and val folders]

The above command uses LARS optimizer and a default batch size of 4096.

Models and Logs

Our pre-trained ResNet-50 models and logs:

pre-train epochs	batch size	pre-train ckpt	pre-train log	linear cls. ckpt	linear cls. log	top-1 acc.
100	512	link	link	link	link	68.1
100	256	link	link	link	link	68.3

Settings for the above: 8 NVIDIA V100 GPUs, CUDA 10.1/CuDNN 7.6.5, PyTorch 1.7.0.

Transferring to Object Detection

Same as MoCo for object detection transfer, please see moco/detection.

License

This project is under the CC-BY-NC 4.0 license. See LICENSE for details.

PyTorch implementation of SimSiam: Exploring Simple Siamese Representation Learning

Related tags

Overview

SimSiam: Exploring Simple Siamese Representation Learning

Preparation

Unsupervised Pre-Training

Linear Classification

Models and Logs

Transferring to Object Detection

License

Owner

Facebook Research

Oriented Response Networks, in CVPR 2017

RINDNet: Edge Detection for Discontinuity in Reflectance, Illumination, Normal and Depth, in ICCV 2021 (oral)

Continuous Query Decomposition for Complex Query Answering in Incomplete Knowledge Graphs

ExCon: Explanation-driven Supervised Contrastive Learning

Keras-1D-NN-Classifier

Per-Pixel Classification is Not All You Need for Semantic Segmentation

Affine / perspective transformation in Pose Estimation with Tensorflow 2

Implementation of Deep Deterministic Policy Gradiet Algorithm in Tensorflow

Airborne magnetic data of the Osborne Mine and Lightning Creek sill complex, Australia

Relative Uncertainty Learning for Facial Expression Recognition

Machine learning, in numpy

Certifiable Outlier-Robust Geometric Perception

Dense Unsupervised Learning for Video Segmentation (NeurIPS*2021)

Implementation of Bidirectional Recurrent Independent Mechanisms (Learning to Combine Top-Down and Bottom-Up Signals in Recurrent Neural Networks with Attention over Modules)

Official Pytorch implementation of "Learning to Estimate Robust 3D Human Mesh from In-the-Wild Crowded Scenes", CVPR 2022

GUI for TOAD-GAN, a PCG-ML algorithm for Token-based Super Mario Bros. Levels.

Square Root Bundle Adjustment for Large-Scale Reconstruction

An end-to-end image translation model with weight-map for color constancy

Repo for my Tensorflow/Keras CV experiments. Mostly revolving around the Danbooru20xx dataset

Pytorch implementation of forward and inverse Haar Wavelets 2D