An example of Scatterbrain implementation (combining local attention and Performer)

Last update: Jan 02, 2023

Related tags

Overview

We use the template from https://github.com/ashleve/lightning-hydra-template. Please read the instructions there to understand the repo structure.

Implementation & Experiments

An example of Scatterbrain implementation (combining local attention and Performer) is in the file src/models/modules/attention/sblocal.py.

T2T-ViT inference on ImageNet

To run the T2T-ViT inference on ImageNet experiment:

Download the pretrained weights from the [T2T-ViT repo][https://github.com/yitu-opensource/T2T-ViT/releases]:

mkdir -p checkpoints/t2tvit
cd checkpoints/t2tvit
wget https://github.com/yitu-opensource/T2T-ViT/releases/download/main/81.7_T2T_ViTt_14.pth.tar

Convert the weights to the format compatible with our implementation of T2T-ViT:

# cd to scatterbrain path
python scripts/convert_checkpoint_t2t_vit.py checkpoints/t2tvit/81.7_T2T_ViTt_14.pth.tar

Download the ImageNet dataset (just the validation set will suffice). Below, /path/to/imagenet refers to the directory that contains the train and val directories.
Run the inference experiments:

python run.py experiment=imagenet-t2tvit-eval.yaml model/t2tattn_cfg=full datamodule.data_dir=/path/to/imagenet/ eval.ckpt=checkpoints/t2tvit/81.7_T2T_ViTt_14.pth.tar  # 81.7% acc
python run.py experiment=imagenet-t2tvit-eval.yaml model/t2tattn_cfg=local datamodule.data_dir=/path/to/imagenet/ eval.ckpt=checkpoints/t2tvit/81.7_T2T_ViTt_14.pth.tar  # 80.6% acc
python run.py experiment=imagenet-t2tvit-eval.yaml model/t2tattn_cfg=performer datamodule.data_dir=/path/to/imagenet/ eval.ckpt=checkpoints/t2tvit/81.7_T2T_ViTt_14.pth.tar  # 77.8-79.0% acc (there's randomness)
python run.py experiment=imagenet-t2tvit-eval.yaml model/t2tattn_cfg=sblocal datamodule.data_dir=/path/to/imagenet/ eval.ckpt=checkpoints/t2tvit/81.7_T2T_ViTt_14.pth.tar  # 81.1% acc

Requirements

Python 3.8+, Pytorch 1.9+, torchvision, torchtext, pytorch-fast-transformers, munch, einops, timm, hydra-core, hydra-colorlog, python-dotenv, rich, pytorch-lightning, lightning-bolts.

We provide a Dockerfile that lists all the required packages.

Citation

If you use this codebase, or otherwise found our work valuable, please cite:

@inproceedings{chen2021scatterbrain,
  title={Scatterbrain: Unifying Sparse and Low-rank Attention},
  author={Beidi Chen and Tri Dao and Eric Winsor and Zhao Song and Atri Rudra and Christopher R\'{e}},
  booktitle={Advances in Neural Information Processing Systems (NeurIPS)},
  year={2021}
}

An example of Scatterbrain implementation (combining local attention and Performer)

Related tags

Overview

Implementation & Experiments

T2T-ViT inference on ImageNet

Requirements

Citation

Owner

HazyResearch

PyTorch - Python + Nim

An Image compression simulator that uses Source Extractor and Monte Carlo methods to examine the post compressive effects different compression algorithms have.

WPPNets: Unsupervised CNN Training with Wasserstein Patch Priors for Image Superresolution

Some toy examples of score matching algorithms written in PyTorch

Public repository of the 3DV 2021 paper "Generative Zero-Shot Learning for Semantic Segmentation of 3D Point Clouds"

A human-readable PyTorch implementation of "Self-attention Does Not Need O(n^2) Memory"

RLBot Python bindings for the Rust crate rl_ball_sym

J.A.R.V.I.S is an AI virtual assistant made in python.

Python based framework for Automatic AI for Regression and Classification over numerical data.

这是一个unet-pytorch的源码，可以训练自己的模型

Industrial Image Anomaly Localization Based on Gaussian Clustering of Pre-trained Feature

PHOTONAI is a high level python API for designing and optimizing machine learning pipelines.

Using some basic methods to show linkages and transformations of robotic arms

SuMa++: Efficient LiDAR-based Semantic SLAM (Chen et al IROS 2019)

Roadmap to becoming a machine learning engineer in 2020

Code for ICML 2021 paper: How could Neural Networks understand Programs?

Ppq - A powerful offline neural network quantization tool with custimized IR

Data and Code for paper Outlining and Filling: Hierarchical Query Graph Generation for Answering Complex Questions over Knowledge Graph is available for research purposes.

This package proposes simplified exporting pytorch models to ONNX and TensorRT, and also gives some base interface for model inference.

[AAAI22] Reliable Propagation-Correction Modulation for Video Object Segmentation