Official codebase for Pretrained Transformers as Universal Computation Engines.

Last update: Dec 28, 2022

Related tags

Overview

universal-computation

Overview

Official codebase for Pretrained Transformers as Universal Computation Engines. Contains demo notebook and scripts to reproduce experiments.

Project Demo

For a minimal demonstration of frozen pretrained transformers, see demo.ipynb. You can run the notebook which reproduces the Bit XOR experiment in a couple minutes, and visualizes the learned attention maps.

Status

Project is released but will receive updates soon.

Currently the repo supports the following tasks:

['bit-memory', 'bit-xor', 'listops', 'mnist', 'cifar10', 'cifar10-gray']

Note that CIFAR-10 LRA is cifar10-gray with a patch size of 1.

Usage

Installation

Install Anaconda environment:
```
$ conda env create -f environment.yml
```
Add universal-computation/ to your PYTHONPATH, i.e. add this line to your ~/.bashrc:
```
export PYTHONPATH=~/universal-computation:$PYTHONPATH
```

Downloading datasets

Datasets are stored in data/. MNIST and CIFAR-10 are automatically downloaded by Pytorch upon starting experiment.

Listops

Download the files for Listops from Long Range Arena. Move the .tsv files into data/listops. There should be three files: basic_test, basic_train, basic_val. The script evaluates on the validation set by default.

Remote homology

Support coming soon.

Running experiments

You can run experiments with:

python scripts/run.py

Adding -w True will log results to Weights and Biases.

Citation

@article{lu2021fpt,
  title={Pretrained Transformers as Universal Computation Engines},
  author={Kevin Lu and Aditya Grover and Pieter Abbeel and Igor Mordatch},
  journal={arXiv preprint arXiv:2103.05247},
  year={2021}
}

Official codebase for Pretrained Transformers as Universal Computation Engines.

Related tags

Overview

universal-computation

Overview

Project Demo

Status

Usage

Installation

Downloading datasets

Listops

Remote homology

Running experiments

Citation

Owner

Kevin Lu

pixelNeRF: Neural Radiance Fields from One or Few Images

Контрольная работа по математическим методам машинного обучения

This is an official implementation for "Video Swin Transformers".

Multi-Glimpse Network With Python

Pretrained models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet.

Unofficial implementation of PatchCore anomaly detection

This is the formal code implementation of the CVPR 2022 paper 'Federated Class Incremental Learning'.

Makes patches from huge resolution .svs slide files using openslide

Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorch

Code to produce syntactic representations that can be used to study syntax processing in the human brain

A Python library for generating new text from existing samples.

Official PyTorch implementation of "Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning" (AAAI 2021)

Fake News Detection Using Machine Learning Methods

Vector.ai assignment

2.86% and 15.85% on CIFAR-10 and CIFAR-100

A curated list of the top 10 computer vision papers in 2021 with video demos, articles, code and paper reference.

This repository contains the data and code for the paper "Diverse Text Generation via Variational Encoder-Decoder Models with Gaussian Process Priors" ([email protected])

Graph neural network message passing reframed as a Transformer with local attention

Recommendation algorithms for large graphs

AlphaNet Improved Training of Supernet with Alpha-Divergence