Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)

Last update: Sep 20, 2022

Related tags

Overview

Skyformer

This repository is the official implementation of Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr"om Method (NeurIPS 2021).

Requirements

To install requirements in a conda environment:

conda create -n skyformer python=3.6
conda activate skyformer
pip install -r requirements.txt

Note: Specific requirements for data preprocessing are not included here.

Data Preparation

Processed files can be downloaded here, or processed with the following steps:

Requirements

tensorboard>=2.3.0
tensorflow>=2.3.1
tensorflow-datasets>=4.0.1

Download the TFDS files for pathfinder and then set _PATHFINER_TFDS_PATH to the unzipped directory (following https://github.com/google-research/long-range-arena/issues/11)
Download lra_release.gz (7.7 GB).
Unzip lra-release and put under ./data/.

cd data
wget https://storage.googleapis.com/long-range-arena/lra_release.gz
tar zxvf lra-release.gz

Create a directory lra_processed under ./data/.

mkdir lra_processed
cd ..

6.The directory structure would be (assuming the root dir is code)

./data/lra-processed
./data/long-range-arena-main
./data/lra_release

Create train, dev, and test dataset pickle files for each task.

cd preprocess
python create_pathfinder.py
python create_listops.py
python create_retrieval.py
python create_text.py
python create_cifar10.py

Note: most source code comes from LRA repo.

Run

Modify the configuration in config.py and run

python main.py --mode train --attn skyformer --task lra-text

mode: train, eval
attn: softmax, nystrom, linformer, reformer, perfromer, informer, bigbird, kernelized, skyformer
task: lra-listops, lra-pathfinder, lra-retrieval, lra-text, lra-image

Reference

@inproceedings{Skyformer,
    title={Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method}, 
    author={Yifan Chen and Qi Zeng and Heng Ji and Yun Yang},
    booktitle={NeurIPS},
    year={2021}
}

Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)

Related tags

Overview

Skyformer

Requirements

Data Preparation

Run

Reference

Owner

Qi Zeng

GenshinMapAutoMarkTools - Tools To add/delete/refresh resources mark in Genshin Impact Map

Spectral normalization (SN) is a widely-used technique for improving the stability and sample quality of Generative Adversarial Networks (GANs)

Effect of Different Encodings and Distance Functions on Quantum Instance-based Classifiers

Notspot robot simulation - Python version

Python with OpenCV - MediaPip Framework Hand Detection

Employs neural networks to classify images into four categories: ship, automobile, dog or frog

PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud, CVPR 2019.

Deep Learning for Time Series Forecasting.

A two-stage U-Net for high-fidelity denoising of historical recordings

MicroNet: Improving Image Recognition with Extremely Low FLOPs (ICCV 2021)

Semantic Bottleneck Scene Generation

Imposter-detector-2022 - HackED 2022 Team 3IQ - 2022 Imposter Detector

Image Classification - A research on image classification and auto insurance claim prediction, a systematic experiments on modeling techniques and approaches

This script scrapes and stores the availability of timeslots for Car Driving Test at all RTA Serivce NSW centres in the state.

Human Pose estimation with TensorFlow framework

Voice Gender Recognition

Deep Learning Emotion decoding using EEG data from Autism individuals

Official Python implementation of the 'Sparse deconvolution'-v0.3.0

Code for Neural-GIF: Neural Generalized Implicit Functions for Animating People in Clothing(ICCV21)

[ICLR'21] Counterfactual Generative Networks