A PyTorch re-implementation of the paper 'Exploring Simple Siamese Representation Learning'. Reproduced the 67.8% Top1 Acc on ImageNet.

Last update: Nov 09, 2022

Related tags

Deep Learning pytorch-SimSiam

Overview

Exploring simple siamese representation learning

This is a PyTorch re-implementation of the SimSiam paper on ImageNet dataset. The results match that reported in the paper. The implementation is based on the codes of MOCO.

Unsupervised pre-training

To run unsupervised pre-training on ImageNet,

sh train_simsiam.sh

This is to do the unsupervised pre-training for 100 epochs. Please modify the path to your ImageNet data folder.

Note 1: I try to follow the setting in the paper, which is bs=512 and lr=0.1 on 8-GPU, but somehow I can not fit it. So I used the max batch_size that I can fit (432) while kept the lr unchaged (0.1).

Note 2: In pre-training, I didn't fix the lr of prediction MLP. According to the paper (Table. 1), fixing the lr of prediction MLP can give slightly improvements (67.7% -> 68.1%). You can try it if interested.

Linear evaluation

To run linear evaluation,

sh train_lincls.sh

The linear evaluation is done using NVIDIA LARC optimizer by setting trus_coefficient=0.001 and clip=False. The batch size is 4096.

Note: I first followed the setting in the paper, which is Lr=0.32 (0.02*4096/256). But I can only got a result of 66.0%. Then I increased the learning rate to Lr=1.6 (0.1*4096.256) and achieved the result of 67.8%. The results and models are given below.

SimSiam	pretrained batchsize	lincls Lr	Top-1 Acc
Reported	512	0.32	67.7%
Reproduced	432 (Model)	1.6	67.8% (Model)
Reproduced	432	0.32	66.0%

Acknowledgement

Thank Xinlei for his help on some implementation details.

A PyTorch re-implementation of the paper 'Exploring Simple Siamese Representation Learning'. Reproduced the 67.8% Top1 Acc on ImageNet.

Related tags

Overview

Exploring simple siamese representation learning

Unsupervised pre-training

Linear evaluation

Acknowledgement

Owner

Taojiannan Yang

EEGEyeNet is benchmark to evaluate ET prediction based on EEG measurements with an increasing level of difficulty

Code and data for the EMNLP 2021 paper "Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts". Coming soon!

Much faster than SORT(Simple Online and Realtime Tracking), a little worse than SORT

Aspect-Sentiment-Multiple-Opinion Triplet Extraction (NLPCC 2021)

Type4Py: Deep Similarity Learning-Based Type Inference for Python

Official Pytorch implementation of MixMo framework

Pytorch implementation of paper "Learning Co-segmentation by Segment Swapping for Retrieval and Discovery"

A plug-and-play library for neural networks written in Python

Official code for "Eigenlanes: Data-Driven Lane Descriptors for Structurally Diverse Lanes", CVPR2022

Collective Multi-type Entity Alignment Between Knowledge Graphs (WWW'20)

Deep learning image registration library for PyTorch

Implementation of Uformer, Attention-based Unet, in Pytorch

Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"

A pre-trained model with multi-exit transformer architecture.

DeepLab-ResNet rebuilt in TensorFlow

Feedback is important: response-aware feedback mechanism for background based conversation

PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection?

Proximal Backpropagation - a neural network training algorithm that takes implicit instead of explicit gradient steps

Winning Solution in NTIRE19 Challenges on Video Restoration and Enhancement (CVPR19 Workshops) - Video Restoration with Enhanced Deformable Convolutional Networks. EDVR has been merged into BasicSR and this repo is a mirror of BasicSR.

DiscoNet: Learning Distilled Collaboration Graph for Multi-Agent Perception [NeurIPS 2021]