Code for the paper A Theoretical Analysis of the Repetition Problem in Text Generation

Last update: Nov 21, 2022

Related tags

Overview

A Theoretical Analysis of the Repetition Problem in Text Generation

This repository share the code for the paper "A Theoretical Analysis of the Repetition Problem in Text Generation" in AAAI 2021. The repetition problem has been observed in nearly all text generation models. We theoretically prove that this problem is, unfortunately, caused by the traits of our language itself. There exists too many words predicting the same word as the subsequent word with high probability. Consequently, it is easy to go back to that word and form repetitions. We dub this problem as the high inflow problem. Based on the theoretical analysis, we propose a novel rebalanced encoding approach to alleviate the high inflow problem.

[arXiv]

Requirements

GCC >= 4.8
Python >= 3.7

Install

git clone https://github.com/fuzihaofzh/repetition-problem-nlg.git
cd repetition-problem-nlg
./scripts/setup.sh

iwslt14

Preprocess Data

./scripts/iwslt14_preprocess.sh

Train

./scripts/iwslt14_train.sh iwslt14deen_fastbpe_10000
./scripts/iwslt14_train.sh iwslt14deen_fastbpe_10000_re0.1

Test

./scripts/iwslt14_test.sh

Results can be found in output/eval/*

wiki103

Download the preprocessed data

git clone https://github.com/fuzihaofzh/preprocessed_wiki103.git output/preprocessed/wiki103

This may take few minutes to complete.

Preprocess Data

./scripts/wiki103_preprocess.sh

Train

./scripts/wiki103_train.sh wiki103_fastbpe_10000
./scripts/wiki103_train.sh wiki103_fastbpe_10000_re0.1

Test

./scripts/wiki103_test.sh

Results can be found in output/eval/*

Cite

@inproceedings{fu2020a,
  title={A Theoretical Analysis of the Repetition Problem in Text Generation.},
  author={Fu, Zihao and Lam, Wai and So, Anthony Man-Cho and Shi, Bei },
  booktitle={Thirty-Fifth AAAI Conference on Artificial Intelligence},
  year={2021}
}

Code for the paper A Theoretical Analysis of the Repetition Problem in Text Generation

Related tags

Overview

A Theoretical Analysis of the Repetition Problem in Text Generation

Requirements

Install

iwslt14

Preprocess Data

Train

Test

wiki103

Download the preprocessed data

Preprocess Data

Train

Test

Cite

Owner

Zihao Fu

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with ONNX, TensorRT, ncnn, and OpenVINO supported.

Reference implementation for Deep Unsupervised Learning using Nonequilibrium Thermodynamics

Implementation of Online Label Smoothing in PyTorch

Automatic differentiation with weighted finite-state transducers.

Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks

Make your first PR. A beginner friendly repository made specifically for open source beginners. Add any program under any language (it can be anything from a simple program to a complex data structure algorithm). Happy coding...

Relaxed-machines - explorations in neuro-symbolic differentiable interpreters

Discord Multi Tool that focuses on design and easy usage

Stochastic Normalizing Flows

My Body is a Cage: the Role of Morphology in Graph-Based Incompatible Control

Dynamic Divide-and-Conquer Adversarial Training for Robust Semantic Segmentation （ICCV2021）

(SIGIR2020) “Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback’’

My personal code and solution to the Synacor Challenge from 2012 OSCON.

The ARCA23K baseline system

Code for Efficient Visual Pretraining with Contrastive Detection

A mini-course offered to Undergrad chemistry students

social humanoid robots with GPGPU and IoT

Curated list of awesome GAN applications and demo

Python library to receive live stream events like comments and gifts in realtime from TikTok LIVE.

Tracking Pipeline helps you to solve the tracking problem more easily