Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)

Last update: Sep 20, 2022

Related tags

Overview

Skyformer

This repository is the official implementation of Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr"om Method (NeurIPS 2021).

Requirements

To install requirements in a conda environment:

conda create -n skyformer python=3.6
conda activate skyformer
pip install -r requirements.txt

Note: Specific requirements for data preprocessing are not included here.

Data Preparation

Processed files can be downloaded here, or processed with the following steps:

Requirements

tensorboard>=2.3.0
tensorflow>=2.3.1
tensorflow-datasets>=4.0.1

Download the TFDS files for pathfinder and then set _PATHFINER_TFDS_PATH to the unzipped directory (following https://github.com/google-research/long-range-arena/issues/11)
Download lra_release.gz (7.7 GB).
Unzip lra-release and put under ./data/.

cd data
wget https://storage.googleapis.com/long-range-arena/lra_release.gz
tar zxvf lra-release.gz

Create a directory lra_processed under ./data/.

mkdir lra_processed
cd ..

6.The directory structure would be (assuming the root dir is code)

./data/lra-processed
./data/long-range-arena-main
./data/lra_release

Create train, dev, and test dataset pickle files for each task.

cd preprocess
python create_pathfinder.py
python create_listops.py
python create_retrieval.py
python create_text.py
python create_cifar10.py

Note: most source code comes from LRA repo.

Run

Modify the configuration in config.py and run

python main.py --mode train --attn skyformer --task lra-text

mode: train, eval
attn: softmax, nystrom, linformer, reformer, perfromer, informer, bigbird, kernelized, skyformer
task: lra-listops, lra-pathfinder, lra-retrieval, lra-text, lra-image

Reference

@inproceedings{Skyformer,
    title={Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method}, 
    author={Yifan Chen and Qi Zeng and Heng Ji and Yun Yang},
    booktitle={NeurIPS},
    year={2021}
}

Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)

Related tags

Overview

Skyformer

Requirements

Data Preparation

Run

Reference

Owner

Qi Zeng

Code for "The Intrinsic Dimension of Images and Its Impact on Learning" - ICLR 2021 Spotlight

BEAMetrics: Benchmark to Evaluate Automatic Metrics in Natural Language Generation

Code for IntraQ, PyTorch implementation of our paper under review

This repository contains the code for the CVPR 2021 paper "GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields"

A universal memory dumper using Frida

Training deep models using anime, illustration images.

pq is a jq-like Pickle file viewer

Contains supplementary materials for reproduce results in HMC divergence time estimation manuscript

통일된 DataScience 폴더 구조 제공 및 가상환경 작업의 부담감 해소

NeuTex: Neural Texture Mapping for Volumetric Neural Rendering

Official repository for the paper "GN-Transformer: Fusing AST and Source Code information in Graph Networks".

Code for paper "Vocabulary Learning via Optimal Transport for Neural Machine Translation"

Semantic Segmentation with SegFormer on Drone Dataset.

Rational Activation Functions - Replacing Padé Activation Units

Inhomogeneous Social Recommendation with Hypergraph Convolutional Networks

Decision Transformer: A brand new Offline RL Pattern

Towards Flexible Blind JPEG Artifacts Removal (FBCNN, ICCV 2021)

Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it

Generic template to bootstrap your PyTorch project with PyTorch Lightning, Hydra, W&B, and DVC.

Point Cloud Registration using Representative Overlapping Points.