SwinTransformerV2-TensorFlow

A TensorFlow implementation of SwinTransformerV2 by Microsoft Research Asia, based on their official implementation of SwinTransformerV1 and their paper on V2.

Paper on Version 2 (18/11/2021): [arXiv]

Paper on Version 1 (17/08/2021): [arXiv]

Features:

TensorFlow 2 implementation of version 1 and 2 of the SwinTransformer, a state-of-the-art backbone for many contemporaty tasks in computer vision. A brief overview of the architectural changes made in version 2:

A pre-norm configuration replaces the previous post-norm configuration, meant to improve training stability in larger models.
A scaled cosine attention replaces the dot product attention in V1, with a learnable scaler.
A continuous log-spaced relative position bias is used instead of the previous parametric table approach. This is implemented here as a small MLP network and a log transform on the relative coordinates bias.

Requirements:

numpy==1.21.4
tensorflow==2.7.0
tensorflow_addons==0.15.0

Getting started

Currently writing up.

License

This project is licensed under the MIT license.

Citation

@article{liu2021Swin,
  title={Swin Transformer: Hierarchical Vision Transformer using Shifted Windows},
  author={Liu, Ze and Lin, Yutong and Cao, Yue and Hu, Han and Wei, Yixuan and Zhang, Zheng and Lin, Stephen and Guo, Baining},
  journal={arXiv preprint arXiv:2103.14030},
  year={2021}
}

Implementation of SwinTransformerV2 in TensorFlow.

Related tags

Overview

SwinTransformerV2-TensorFlow

Features:

Requirements:

Getting started

License

Citation

Owner

Phan Nguyen

EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering

Tracking Pipeline helps you to solve the tracking problem more easily

This repo contains the pytorch implementation for Dynamic Concept Learner (accepted by ICLR 2021).

Implementation of Memformer, a Memory-augmented Transformer, in Pytorch

Official implementations of PSENet, PAN and PAN++.

BEGAN in PyTorch

ProMP: Proximal Meta-Policy Search

Existing Literature about Machine Unlearning

Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer

subpixel: A subpixel convnet for super resolution with Tensorflow

Toolkit for collecting and applying prompts

ReAct: Out-of-distribution Detection With Rectified Activations

Code of the paper "Shaping Visual Representations with Attributes for Few-Shot Learning (ASL)".

Python Jupyter kernel using Poetry for reproducible notebooks

Official PyTorch Implementation of HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning (NeurIPS 2021 Spotlight)

This code implements constituency parse tree aggregation

The original implementation of TNDM used in the NeurIPS 2021 paper (no longer being updated)

CO-PILOT: COllaborative Planning and reInforcement Learning On sub-Task curriculum

Regulatory Instruments for Fair Personalized Pricing.

GUI for a Vocal Remover that uses Deep Neural Networks.