Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers"

Last update: Nov 15, 2022

Overview

Recurrent Fast Weight Programmers

This is the official repository containing the code we used to produce the experimental results reported in the paper:

Going Beyond Linear Transformers with Recurrent Fast Weight Programmers

algorithmic directory for code execution and ListOps
language_modeling directory for language modeling
reinforcement_learning directory for RL

Separate license files can be found under each directory.

General instructions

Please refer to the readme file in each directory for further instructions.

In all tasks, our custom CUDA kernels will be automatically compiled. To avoid recompiling the code multiple times, we recommend to specify the path to a directory to store the compiled code via:

export TORCH_EXTENSIONS_DIR="/home/me/torch_extensions/lm"

Such a line is already included in the example scripts we provide. Please change the path to a safe directory of your choice.

Important: separate paths should be used for different tasks (i.e. here, one for language modeling, one for code execution, one for ListOps, and one for RL).

BibTex

@article{irie2021going,
      title={Going Beyond Linear Transformers with Recurrent Fast Weight Programmers}, 
      author={Kazuki Irie and Imanol Schlag and R\'obert Csord\'as and J\"urgen Schmidhuber},
      journal={Preprint arXiv:2106.06295},
      year={2021}
}

Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers"

Related tags

Overview

Recurrent Fast Weight Programmers

Contents

General instructions

BibTex

Links

Owner

IDSIA

Pytorch implementation of paper Semi-supervised Knowledge Transfer for Deep Learning from Private Training Data

Implementation for our ICCV 2021 paper: Dual-Camera Super-Resolution with Aligned Attention Modules

This is a work in progress reimplementation of Instant Neural Graphics Primitives

Misc YOLOL scripts for use in the Starbase space sandbox videogame

Neural Surface Maps

StocksMA is a package to facilitate access to financial and economic data of Moroccan stocks.

A foreign language learning aid using a neural network to predict probability of translating foreign words

A scientific and useful toolbox, which contains practical and effective long-tail related tricks with extensive experimental results

A set of Deep Reinforcement Learning Agents implemented in Tensorflow.

Pytorch implementation of YOLOX、PPYOLO、PPYOLOv2、FCOS an so on.

TensorFlow implementation of "A Simple Baseline for Bayesian Uncertainty in Deep Learning"

A simple python module to generate anchor (aka default/prior) boxes for object detection tasks.

This code provides various models combining dilated convolutions with residual networks

A Traffic Sign Recognition Project which can help the driver recognise the signs via text as well as audio. Can be used at Night also.

Open-CyKG: An Open Cyber Threat Intelligence Knowledge Graph

Share a benchmark that can easily apply reinforcement learning in Job-shop-scheduling

Reinforcement Learning for Automated Trading

Colar: Effective and Efficient Online Action Detection by Consulting Exemplars, CVPR 2022.

Patient-Survival - Using Python, I developed a Machine Learning model using classification techniques such as Random Forest and SVM classifiers to predict a patient's survival status that have undergone breast cancer surgery.

GNN-based Recommendation Benchma