Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)

Last update: Sep 20, 2022

Related tags

Overview

Skyformer

This repository is the official implementation of Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr"om Method (NeurIPS 2021).

Requirements

To install requirements in a conda environment:

conda create -n skyformer python=3.6
conda activate skyformer
pip install -r requirements.txt

Note: Specific requirements for data preprocessing are not included here.

Data Preparation

Processed files can be downloaded here, or processed with the following steps:

Requirements

tensorboard>=2.3.0
tensorflow>=2.3.1
tensorflow-datasets>=4.0.1

Download the TFDS files for pathfinder and then set _PATHFINER_TFDS_PATH to the unzipped directory (following https://github.com/google-research/long-range-arena/issues/11)
Download lra_release.gz (7.7 GB).
Unzip lra-release and put under ./data/.

cd data
wget https://storage.googleapis.com/long-range-arena/lra_release.gz
tar zxvf lra-release.gz

Create a directory lra_processed under ./data/.

mkdir lra_processed
cd ..

6.The directory structure would be (assuming the root dir is code)

./data/lra-processed
./data/long-range-arena-main
./data/lra_release

Create train, dev, and test dataset pickle files for each task.

cd preprocess
python create_pathfinder.py
python create_listops.py
python create_retrieval.py
python create_text.py
python create_cifar10.py

Note: most source code comes from LRA repo.

Run

Modify the configuration in config.py and run

python main.py --mode train --attn skyformer --task lra-text

mode: train, eval
attn: softmax, nystrom, linformer, reformer, perfromer, informer, bigbird, kernelized, skyformer
task: lra-listops, lra-pathfinder, lra-retrieval, lra-text, lra-image

Reference

@inproceedings{Skyformer,
    title={Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method}, 
    author={Yifan Chen and Qi Zeng and Heng Ji and Yun Yang},
    booktitle={NeurIPS},
    year={2021}
}

Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)

Related tags

Overview

Skyformer

Requirements

Data Preparation

Run

Reference

Owner

Qi Zeng

EfficientMPC - Efficient Model Predictive Control Implementation

Simple tool to combine(merge) onnx models. Simple Network Combine Tool for ONNX.

SWA Object Detection

A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) based on Deep Filtering.

End-to-end beat and downbeat tracking in the time domain.

A Pytorch implement of paper "Anomaly detection in dynamic graphs via transformer" (TADDY).

This is the offical website for paper ''Category-consistent deep network learning for accurate vehicle logo recognition''

Multi-Task Learning as a Bargaining Game

Flexible Networks for Learning Physical Dynamics of Deformable Objects (2021)

Code for "LASR: Learning Articulated Shape Reconstruction from a Monocular Video". CVPR 2021.

DLFlow is a deep learning framework.

PyTorch implementation of InstaGAN: Instance-aware Image-to-Image Translation

StyleGAN of All Trades: Image Manipulation withOnly Pretrained StyleGAN

Code for paper "Extract, Denoise and Enforce: Evaluating and Improving Concept Preservation for Text-to-Text Generation" EMNLP 2021

Official PyTorch implementation of the paper Image-Based CLIP-Guided Essence Transfer.

The FIRST GANs-based omics-to-omics translation framework

Source code of generalized shuffled linear regression

Code repository for Semantic Terrain Classification for Off-Road Autonomous Driving

Self-Supervised Learning of Event-based Optical Flow with Spiking Neural Networks

Implementation of Perceiver, General Perception with Iterative Attention in TensorFlow