Stochastic Positional Encoding (SPE)

This is the source code repository for the ICML 2021 paper Relative Positional Encoding for Transformers with Linear Complexity by Antoine Liutkus, Ondřej Cífka, Shih-Lun Wu, Umut Şimşekli, Yi-Hsuan Yang and Gaël Richard.

In this paper, we propose Stochastic Positional Encoding (SPE), which provably behaves like relative PE while being compatible with linear-complexity Transformers. We do this by drawing a connection between positional encoding and cross-covariance structures of correlated Gaussian processes.

Check out also the companion website with music examples.

Citation:

@inproceedings{pmlr-v139-liutkus21a,
  title = 	 {Relative Positional Encoding for {Transformers} with Linear Complexity},
  author =       {Liutkus, Antoine and C{\'i}fka, Ond{\v r}ej and Wu, Shih-Lun and {\c S}im{\c s}ekli, Umut and Yang, Yi-Hsuan and Richard, Ga{\"e}l},
  booktitle = 	 {Proceedings of the 38th International Conference on Machine Learning},
  pages = 	 {7067--7079},
  year = 	 {2021},
  editor = 	 {Meila, Marina and Zhang, Tong},
  volume = 	 {139},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {18--24 Jul},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v139/liutkus21a/liutkus21a.pdf},
  url = 	 {http://proceedings.mlr.press/v139/liutkus21a.html}
}

SPE implementation

We have implemented SPE in PyTorch and JAX/Flax. Each implementation is available as a separate Python package under src.

Experiments

Each of the 3 experiments (LRA, pop piano generation, groove continuation) has a dedicated directory under experiments. See the README files there for how to set up the environment and prepare the datasets. To make sure you have the custom dependencies for each experiment, clone this repository with --recurse-submodules or run git submodule init && git submodule update after cloning.

Relative Positional Encoding for Transformers with Linear Complexity

Related tags

Overview

Stochastic Positional Encoding (SPE)

SPE implementation

Experiments

Owner

Antoine Liutkus

Learning based AI for playing multi-round Koi-Koi hanafuda card games. Have fun.

Boosting Monocular Depth Estimation Models to High-Resolution via Content-Adaptive Multi-Resolution Merging

NovelD: A Simple yet Effective Exploration Criterion

CoSMA: Convolutional Semi-Regular Mesh Autoencoder. From Paper "Mesh Convolutional Autoencoder for Semi-Regular Meshes of Different Sizes"

Character Grounding and Re-Identification in Story of Videos and Text Descriptions

AniGAN: Style-Guided Generative Adversarial Networks for Unsupervised Anime Face Generation

Playable Video Generation

Jetson Nano-based smart camera system that measures crowd face mask usage in real-time.

Miscellaneous and lightweight network tools

Deep Neural Networks Improve Radiologists' Performance in Breast Cancer Screening

A Robust Non-IoU Alternative to Non-Maxima Suppression in Object Detection

Posterior predictive distributions quantify uncertainties ignored by point estimates.

Scikit-learn compatible estimation of general graphical models

Code to accompany the paper "Finding Bipartite Components in Hypergraphs", which is published in NeurIPS'21.

A rule learning algorithm for the deduction of syndrome definitions from time series data.

IJCAI2020 & IJCV 2020 :city_sunrise: Unsupervised Scene Adaptation with Memory Regularization in vivo

Explanatory Learning: Beyond Empiricism in Neural Networks

"MST++: Multi-stage Spectral-wise Transformer for Efficient Spectral Reconstruction" (CVPRW 2022) & (Winner of NTIRE 2022 Challenge on Spectral Reconstruction from RGB)

When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataset of 53,000+ Legal Holdings

Code for paper entitled "Improving Novelty Detection using the Reconstructions of Nearest Neighbours"