SOFT: Softmax-free Transformer with Linear Complexity, NeurIPS 2021 Spotlight

Last update: Dec 25, 2022

Related tags

Overview

SOFT: Softmax-free Transformer with Linear Complexity

SOFT: Softmax-free Transformer with Linear Complexity,
Jiachen Lu, Jinghan Yao, Junge Zhang, Xiatian Zhu, Hang Xu, Weiguo Gao, Chunjing Xu, Tao Xiang, Li Zhang,
NeurIPS 2021 Spotlight

Requirments

timm==0.3.2
torch>=1.7.0 and torchvision that matches the PyTorch installation
cuda>=10.2

Compilation may be fail on cuda < 10.2.
We have compiled it successfully on cuda 10.2 and cuda 11.2.

Data preparation

Download and extract ImageNet train and val images from http://image-net.org/. The directory structure is the standard layout for the torchvision datasets.ImageFolder, and the training and validation data is expected to be in the train/ folder and val folder respectively:

/path/to/imagenet/
  train/
    class1/
      img1.jpeg
    class2/
      img2.jpeg
  val/
    class1/
      img3.jpeg
    class/2
      img4.jpeg

Installation

git clone https://github.com/fudan-zvg/SOFT.git
python -m pip install -e SOFT

Main results

Image Classification

ImageNet-1K

Model	Resolution	Params	FLOPs	Top-1 %	Config
SOFT-Tiny	224	13M	1.9G	79.3	SOFT_Tiny.yaml, SOFT_Tiny_cuda.yaml
SOFT-Small	224	24M	3.3G	82.2	SOFT_Small.yaml, SOFT_Small_cuda.yaml
SOFT-Medium	224	45M	7.2G	82.9	SOFT_Meidum.yaml, SOFT_Meidum_cuda.yaml
SOFT-Large	224	64M	11.0G	83.1	SOFT_Large.yaml, SOFT_Large_cuda.yaml
SOFT-Huge	224	87M	16.3G	83.3	SOFT_Huge.yaml, SOFT_Huge_cuda.yaml

Get Started

Train

We have two implementations of Gaussian Kernel: PyTorch version and the exact form of Gaussian function implemented by cuda. The config file containing cuda is the cuda implementation. Both implementations yield same performance. Please install SOFT before running the cuda version.

./dist_train.sh ${GPU_NUM} --data ${DATA_PATH} --config ${CONFIG_FILE}
# For example, train SOFT-Tiny on Imagenet training dataset with 8 GPUs
./dist_train.sh 8 --data ${DATA_PATH} --config config/SOFT_Tiny.yaml

Test

./dist_train.sh ${GPU_NUM} --data ${DATA_PATH} --config ${CONFIG_FILE} --eval_checkpoint ${CHECKPOINT_FILE} --eval

# For example, test SOFT-Tiny on Imagenet validation dataset with 8 GPUs

./dist_train.sh 8 --data ${DATA_PATH} --config config/SOFT_Tiny.yaml --eval_checkpoint ${CHECKPOINT_FILE} --eval

Reference

@inproceedings{SOFT,
    title={SOFT: Softmax-free Transformer with Linear Complexity}, 
    author={Lu, Jiachen and Yao, Jinghan and Zhang, Junge and Zhu, Xiatian and Xu, Hang and Gao, Weiguo and Xu, Chunjing and Xiang, Tao and Zhang, Li},
    booktitle={NeurIPS},
    year={2021}
}

License

MIT

Acknowledgement

Thanks to previous open-sourced repo:
Detectron2
T2T-ViT
PVT
Nystromformer
pytorch-image-models

SOFT: Softmax-free Transformer with Linear Complexity, NeurIPS 2021 Spotlight

Related tags

Overview

SOFT: Softmax-free Transformer with Linear Complexity

Requirments

Data preparation

Installation

Main results

Image Classification

ImageNet-1K

Get Started

Train

Test

Reference

License

Acknowledgement

Owner

Fudan Zhang Vision Group

Official code of paper: MovingFashion: a Benchmark for the Video-to-Shop Challenge

Le dataset des images du projet d'IA de 2021

Bounding Wasserstein distance with couplings

PyTea: PyTorch Tensor shape error analyzer

Deep learning image registration library for PyTorch

Super-BPD: Super Boundary-to-Pixel Direction for Fast Image Segmentation (CVPR 2020)

A simple, clean TensorFlow implementation of Generative Adversarial Networks with a focus on modeling illustrations.

This is a simple backtesting framework to help you test your crypto currency trading. It includes a way to download and store historical crypto data and to execute a trading strategy.

Mengzi Pretrained Models

IJON is an annotation mechanism that analysts can use to guide fuzzers such as AFL.

A curated list of long-tailed recognition resources.

Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos

Official pytorch implementation of the AAAI 2021 paper Semantic Grouping Network for Video Captioning

Polyp-PVT: Polyp Segmentation with Pyramid Vision Transformers (arXiv2021)

The official implementation of the paper, "SubTab: Subsetting Features of Tabular Data for Self-Supervised Representation Learning"

Code repository for Semantic Terrain Classification for Off-Road Autonomous Driving

Explicable Reward Design for Reinforcement Learning Agents [NeurIPS'21]

The official PyTorch implementation of Curriculum by Smoothing (NeurIPS 2020, Spotlight).

[内测中]前向式Python环境快捷封装工具，快速将Python打包为EXE并添加CUDA、NoAVX等支持。

FairMOT - A simple baseline for one-shot multi-object tracking