"Investigating the Limitations of Transformers with Simple Arithmetic Tasks", 2021

Last update: Nov 16, 2022

Related tags

Text Data & NLP transformers-arithmetic

Overview

transformers-arithmetic

This repository contains the code to reproduce the experiments from the paper:

Nogueira, Jiang, Lin "Investigating the Limitations of Transformers with Simple Arithmetic Tasks", 2021

First, install the required packages:

pip install -r requirements.txt

The command below trains and evaluates a T5-base model on the task of adding up to 15-digits:

python main.py \
    --output_dir=. \
    --model_name_or_path=t5-base \
    --operation=addition \
    --orthography=10ebased \
    --balance_train \
    --balance_val \
    --train_size=100000 \
    --val_size=10000 \
    --test_size=10000 \
    --min_digits_train=2 \
    --max_digits_train=15 \
    --min_digits_test=2 \
    --max_digits_test=15 \
    --base_number=10 \
    --seed=1 \
    --train_batch_size=4 \
    --accumulate_grad_batches=32 \
    --val_batch_size=32 \
    --max_seq_length=512 \
    --num_workers=4 \
    --gpus=1 \
    --optimizer=AdamW \
    --lr=3e-4 \
    --weight_decay=5e-5 \
    --scheduler=StepLR \
    --t_0=2 \
    --t_mult=2 \
    --gamma=1.0 \
    --step_size=1000 \
    --max_epochs=20 \
    --check_val_every_n_epoch=2 \
    --amp_level=O0 \
    --precision=32 \
    --gradient_clip_val=1.0

This training should take 10 hours on a V100 GPU.

The exact match on the test set should be 1:

--------------------------------------------------------------------------------
DATALOADER:0 TEST RESULTS
{'test_exact_match': 1.0000}
--------------------------------------------------------------------------------

"Investigating the Limitations of Transformers with Simple Arithmetic Tasks", 2021

Related tags

Overview

transformers-arithmetic

Owner

Castorini

Smart discord chatbot integrated with Dialogflow to manage different classrooms and assist in teaching!

Python functions for summarizing and improving voice dictation input.

An implementation of the Pay Attention when Required transformer

CoSENT、STS、SentenceBERT

An easy to use Natural Language Processing library and framework for predicting, training, fine-tuning, and serving up state-of-the-art NLP models.

Python interface for converting Penn Treebank trees to Stanford Dependencies and Universal Depenencies

Blender addon - Scrub timeline from viewport with a shortcut

Experiments in converting wikidata to ftm

ADCS cert template modification and ACL enumeration

nlpcommon is a python Open Source Toolkit for text classification.

PyTorch implementation and pretrained models for XCiT models. See XCiT: Cross-Covariance Image Transformer

Few-shot Natural Language Generation for Task-Oriented Dialog

Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization

[ICLR 2021 Spotlight] Pytorch implementation for "Long-tailed Recognition by Routing Diverse Distribution-Aware Experts."

Ecco is a python library for exploring and explaining Natural Language Processing models using interactive visualizations.

Text to speech is a process to convert any text into voice. Text to speech project takes words on digital devices and convert them into audio. Here I have used Google-text-to-speech library popularly known as gTTS library to convert text file to .mp3 file. Hope you like my project!

Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech tagging and word segmentation.

NLP made easy

Tool to check whether a GCP bucket is public or not.

Entity Disambiguation as text extraction (ACL 2022)