The code for two papers: Feedback Transformer and Expire-Span.

Last update: Dec 25, 2022

Related tags

Text Data & NLP transformer-sequential

Overview

transformer-sequential

This repo contains the code for two papers:

Feedback Transformer
Expire-Span

The training code is structured for long sequential modeling with Transformer-like architectures.

Requirements

You will need a CUDA-enabled GPU to run the code.

Setup

Run the following:

pip install -r requirements.txt

Feedback Transformer

Introduced in Addressing Some Limitations of Transformers with Feedback Memory.

Running Experiments from the Paper

enwik8

Model	Params	Valid	Test
Feedback Transformer	77M	0.984	0.962

Numbers are Bits-Per-Character

bash experiments/feedback/enwik8.sh

Algorithmic

Model	3 Variable	5 Variable
Transformer	33.7	37.5
Feedback Transformer	99.1	92.6

Numbers are % Accuracy on Test

bash experiments/feedback/algorithmic_3var.sh
bash experiments/feedback/algorithmic_5var.sh

Expire-Span

Introduced in Not All Memories are Created Equal: Learning to Expire.

Running Experiments from the Paper

enwik8

Model	Params	Valid	Test
Expire-Span 12L	38M	1.014	0.994

Numbers are Bits-Per-Character

bash experiments/expire_span/enwik8.sh

Object Collision

Model	Maximum Span	Test Error (%)
Expire-Span	16k	52.2
Expire-Span	32k	36.7
Expire-Span	64k	26.7

bash experiments/expire_span/object_collision_16k.sh
bash experiments/expire_span/object_collision_32k.sh
bash experiments/expire_span/object_collision_64k.sh

License

The code is licensed under CC-BY-NC license. See the LICENSE file for more details.

The code for two papers: Feedback Transformer and Expire-Span.

Related tags

Overview

transformer-sequential

Requirements

Setup

Feedback Transformer

Running Experiments from the Paper

enwik8

Algorithmic

Expire-Span

Running Experiments from the Paper

enwik8

Object Collision

License

Owner

Meta Research

An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)

APEACH: Attacking Pejorative Expressions with Analysis on Crowd-generated Hate Speech Evaluation Datasets

Proquabet - Convert your prose into proquints and then you essentially have Vogon poetry

A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".

BookNLP, a natural language processing pipeline for books

LegalNLP - Natural Language Processing Methods for the Brazilian Legal Language

VMD Audio/Text control with natural language

Stuff related to Ben Eater's 8bit breadboard computer

A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion

This repository contains the code for EMNLP-2021 paper "Word-Level Coreference Resolution"

Code for the paper "A Simple but Tough-to-Beat Baseline for Sentence Embeddings".

ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.

Generate a cool README/About me page for your Github Profile

NLP library designed for reproducible experimentation management

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

An implementation of model parallel GPT-3-like models on GPUs, based on the DeepSpeed library. Designed to be able to train models in the hundreds of billions of parameters or larger.

Revisiting Pre-trained Models for Chinese Natural Language Processing (Findings of EMNLP 2020)

無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXの音声合成エンジン

Repository to hold code for the cap-bot varient that is being presented at the SIIC Defence Hackathon 2021.

Words_And_Phrases - Just a repo for useful words and phrases that might come handy in some scenarios. Feel free to add yours