Accuracy Aligned. Concise Implementation of Swin Transformer

Last update: Dec 16, 2022

Related tags

Overview

Accuracy Aligned. Concise Implementation of Swin Transformer

This repository contains the implementation of Swin Transformer, and the training codes on ImageNet datasets. We have aligned the output of our network with the official one, that is, using the same input and random seed, the output is identical to the official one.

Our implementation is highly based on einops, which makes the implementation more concise, and easy to be understand. (Intuitively, we use only 200 lines of codes compared with ~600 lines of official codes.) Besides, our implementation keeps the same training speed.

Model	Epoch	[email protected](our)	[email protected](our)	[email protected](official)	[email protected](official)	pretrained model
Swin-T	300	81.3	95.5	81.2	95.5	here

Usage

Train on ImageNet:

Train Swin-T

python -m torch.distributed.launch --nproc_per_node=8 --use_env train.py --model Swin_T \
--batch-size 128 --drop-path 0.2 --data-path ~/ILSVRC2012/ --output_dir /data/SwinTransformer_exp/SwinT/

Train Swin-S

python -m torch.distributed.launch --nproc_per_node=8 --use_env train.py --model Swin_S \
--batch-size 128 --drop-path 0.3 --data-path ~/ILSVRC2012/ --output_dir /data/SwinTransformer_exp/SwinS/

Train Swin-B

python -m torch.distributed.launch --nproc_per_node=8 --use_env train.py --model Swin_B \
--batch-size 128 --drop-path 0.5 --data-path ~/ILSVRC2012/ --output_dir /data/SwinTransformer_exp/SwinB/

Reference

The training process involves many training and augmentation tricks, such as stochastic depth, mixup, cutmix and random erasing. I borrow large from Deit (https://github.com/facebookresearch/deit).

Citations

@misc{liu2021swin,
      title={Swin Transformer: Hierarchical Vision Transformer using Shifted Windows}, 
      author={Ze Liu and Yutong Lin and Yue Cao and Han Hu and Yixuan Wei and Zheng Zhang and Stephen Lin and Baining Guo},
      year={2021},
      eprint={2103.14030},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Accuracy Aligned. Concise Implementation of Swin Transformer

Related tags

Overview

Accuracy Aligned. Concise Implementation of Swin Transformer

Usage

Reference

Citations

Owner

FengWang

simple artificial intelligence utilities

Art Project "Schrödinger's Game of Life"

AI-Bot - 一个基于watermelon改造的OpenAI-GPT-2的智能机器人

Discriminative Condition-Aware PLDA

Implementation of ConvMixer-Patches Are All You Need? in TensorFlow and Keras

Weakly Supervised Segmentation by Tensorflow.

Repository of best practices for deep learning in Julia, inspired by fastai

EASY - Ensemble Augmented-Shot Y-shaped Learning: State-Of-The-Art Few-Shot Classification with Simple Ingredients.

Official Pytorch implementation of "DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network" (CVPR'21)

Image Segmentation and Object Detection in Pytorch

Learning hidden low dimensional dyanmics using a Generalized Onsager Principle and neural networks

ConformalLayers: A non-linear sequential neural network with associative layers

Realtime YOLO Monster Detection With Non Maximum Supression

A framework for annotating 3D meshes using the predictions of a 2D semantic segmentation model.

PyTorch implementation of "A Two-Stage End-to-End System for Speech-in-Noise Hearing Aid Processing"

Official PyTorch code of DeepPanoContext: Panoramic 3D Scene Understanding with Holistic Scene Context Graph and Relation-based Optimization (ICCV 2021 Oral).

FwordCTF 2021 Infrastructure and Source code of Web/Bash challenges

To model the probability of a soccer coach leave his/her team during Campeonato Brasileiro for 10 chosen teams and considering years 2018, 2019 and 2020.

An essential implementation of BYOL in PyTorch + PyTorch Lightning

Benchmark library for high-dimensional HPO of black-box models based on Weighted Lasso regression