A Theoretical Analysis of the Repetition Problem in Text Generation
Zihao Fu, Wai Lam, Anthony Man-Cho So, Bei Shi

TL;DR
This paper provides a theoretical framework analyzing the causes of repetition in text generation, introduces the Average Repetition Probability, and proposes a rebalanced encoding method to reduce repetitions effectively.
Contribution
It introduces a new theoretical framework and the ARP metric, analyzes the causes of repetition, and proposes a novel rebalanced encoding approach to mitigate the problem.
Findings
Repetition is linked to high word prediction overlap.
Existing methods implicitly minimize upper bounds of ARP.
Rebalanced encoding significantly reduces repetition.
Abstract
Text generation tasks, including translation, summarization, language models, and etc. see rapid growth during recent years. Despite the remarkable achievements, the repetition problem has been observed in nearly all text generation models undermining the generation performance extensively. To solve the repetition problem, many methods have been proposed, but there is no existing theoretical analysis to show why this problem happens and how it is resolved. In this paper, we propose a new framework for theoretical analysis for the repetition problem. We first define the Average Repetition Probability (ARP) to characterize the repetition problem quantitatively. Then, we conduct an extensive analysis of the Markov generation model and derive several upper bounds of the average repetition probability with intuitive understanding. We show that most of the existing methods are essentially…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems
