Code for the paper A Theoretical Analysis of the Repetition Problem in Text Generation

Last update: Nov 21, 2022

Related tags

Overview

A Theoretical Analysis of the Repetition Problem in Text Generation

This repository share the code for the paper "A Theoretical Analysis of the Repetition Problem in Text Generation" in AAAI 2021. The repetition problem has been observed in nearly all text generation models. We theoretically prove that this problem is, unfortunately, caused by the traits of our language itself. There exists too many words predicting the same word as the subsequent word with high probability. Consequently, it is easy to go back to that word and form repetitions. We dub this problem as the high inflow problem. Based on the theoretical analysis, we propose a novel rebalanced encoding approach to alleviate the high inflow problem.

[arXiv]

Requirements

GCC >= 4.8
Python >= 3.7

Install

git clone https://github.com/fuzihaofzh/repetition-problem-nlg.git
cd repetition-problem-nlg
./scripts/setup.sh

iwslt14

Preprocess Data

./scripts/iwslt14_preprocess.sh

Train

./scripts/iwslt14_train.sh iwslt14deen_fastbpe_10000
./scripts/iwslt14_train.sh iwslt14deen_fastbpe_10000_re0.1

Test

./scripts/iwslt14_test.sh

Results can be found in output/eval/*

wiki103

Download the preprocessed data

git clone https://github.com/fuzihaofzh/preprocessed_wiki103.git output/preprocessed/wiki103

This may take few minutes to complete.

Preprocess Data

./scripts/wiki103_preprocess.sh

Train

./scripts/wiki103_train.sh wiki103_fastbpe_10000
./scripts/wiki103_train.sh wiki103_fastbpe_10000_re0.1

Test

./scripts/wiki103_test.sh

Results can be found in output/eval/*

Cite

@inproceedings{fu2020a,
  title={A Theoretical Analysis of the Repetition Problem in Text Generation.},
  author={Fu, Zihao and Lam, Wai and So, Anthony Man-Cho and Shi, Bei },
  booktitle={Thirty-Fifth AAAI Conference on Artificial Intelligence},
  year={2021}
}

Code for the paper A Theoretical Analysis of the Repetition Problem in Text Generation

Related tags

Overview

A Theoretical Analysis of the Repetition Problem in Text Generation

Requirements

Install

iwslt14

Preprocess Data

Train

Test

wiki103

Download the preprocessed data

Preprocess Data

Train

Test

Cite

Owner

Zihao Fu

A model which classifies reviews as positive or negative.

MoCoGAN: Decomposing Motion and Content for Video Generation

This repository contains demos I made with the Transformers library by HuggingFace.

Accelerating BERT Inference for Sequence Labeling via Early-Exit

A package for music online and offline rhythmic information analysis including music Beat, downbeat, tempo and meter tracking.

[ICME 2021 Oral] CORE-Text: Improving Scene Text Detection with Contrastive Relational Reasoning

[CVPR'21] Locally Aware Piecewise Transformation Fields for 3D Human Mesh Registration

NeurIPS 2021, "Fine Samples for Learning with Noisy Labels"

Deep and online learning with spiking neural networks in Python

SciKit-Learn Laboratory (SKLL) makes it easy to run machine learning experiments.

This is an open solution to the Home Credit Default Risk challenge 🏡

🔥 Real-time Super Resolution enhancement (4x) with content loss and relativistic adversarial optimization 🔥

PyTorch Implementation for Deep Metric Learning Pipelines

An experiment on the performance of homemade Q-learning AIs in Agar.io depending on their state representation and available actions

A PyTorch implementation of "Predict then Propagate: Graph Neural Networks meet Personalized PageRank" (ICLR 2019).

Rasterize with the least efforts for researchers.

VGG16 model-based classification project about brain tumor detection.

Codebase for the paper titled "Continual learning with local module selection"

Pytorch implementation for M^3L

OpenFed: A Comprehensive and Versatile Open-Source Federated Learning Framework