Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers"

Last update: Nov 15, 2022

Overview

Recurrent Fast Weight Programmers

This is the official repository containing the code we used to produce the experimental results reported in the paper:

Going Beyond Linear Transformers with Recurrent Fast Weight Programmers

algorithmic directory for code execution and ListOps
language_modeling directory for language modeling
reinforcement_learning directory for RL

Separate license files can be found under each directory.

General instructions

Please refer to the readme file in each directory for further instructions.

In all tasks, our custom CUDA kernels will be automatically compiled. To avoid recompiling the code multiple times, we recommend to specify the path to a directory to store the compiled code via:

export TORCH_EXTENSIONS_DIR="/home/me/torch_extensions/lm"

Such a line is already included in the example scripts we provide. Please change the path to a safe directory of your choice.

Important: separate paths should be used for different tasks (i.e. here, one for language modeling, one for code execution, one for ListOps, and one for RL).

BibTex

@article{irie2021going,
      title={Going Beyond Linear Transformers with Recurrent Fast Weight Programmers}, 
      author={Kazuki Irie and Imanol Schlag and R\'obert Csord\'as and J\"urgen Schmidhuber},
      journal={Preprint arXiv:2106.06295},
      year={2021}
}

Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers"

Related tags

Overview

Recurrent Fast Weight Programmers

Contents

General instructions

BibTex

Links

Owner

IDSIA

Using this you can control your PC/Laptop volume by Hand Gestures (pinch-in, pinch-out) created with Python.

Deep Video Matting via Spatio-Temporal Alignment and Aggregation [CVPR2021]

A library for performing coverage guided fuzzing of neural networks

VSR-Transformer - This paper proposes a new Transformer for video super-resolution (called VSR-Transformer).

To model the probability of a soccer coach leave his/her team during Campeonato Brasileiro for 10 chosen teams and considering years 2018, 2019 and 2020.

A Unified Generative Framework for Various NER Subtasks.

Python Classes: Medical Insurance Project using Object Oriented Programming Concepts

BoxInst: High-Performance Instance Segmentation with Box Annotations

Grounding Representation Similarity with Statistical Testing

GT4SD, an open-source library to accelerate hypothesis generation in the scientific discovery process.

FCAF3D: Fully Convolutional Anchor-Free 3D Object Detection

Bayesian Optimization Library for Medical Image Segmentation.

Cycle Consistent Adversarial Domain Adaptation (CyCADA)

DyNet: The Dynamic Neural Network Toolkit

Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning. CVPR 2018

Experiments for Neural Flows paper

Embracing Single Stride 3D Object Detector with Sparse Transformer

A Python Package for Convex Regression and Frontier Estimation

This library provides an abstraction to perform Model Versioning using Weight & Biases.

Object detection on multiple datasets with an automatically learned unified label space.