Collapse of Self-trained Language Models

David Herel; Tomas Mikolov

arXiv:2404.02305·cs.CL·April 4, 2024·1 cites

Collapse of Self-trained Language Models

David Herel, Tomas Mikolov

PDF

Open Access 1 Repo

TL;DR

This paper investigates the effects of self-training language models on their own outputs, revealing that extended self-training causes performance degradation and output collapse, highlighting limitations of this approach.

Contribution

It provides the first systematic analysis of self-training in language models, demonstrating its practical limitations and risks of collapse.

Findings

01

Extended self-training degrades GPT-2 performance.

02

Self-training leads to repetitive, collapsed outputs.

03

Self-training has limited benefits and potential risks.

Abstract

In various fields of knowledge creation, including science, new ideas often build on pre-existing information. In this work, we explore this concept within the context of language models. Specifically, we explore the potential of self-training models on their own outputs, akin to how humans learn and build on their previous thoughts and actions. While this approach is intuitively appealing, our research reveals its practical limitations. We find that extended self-training of the GPT-2 model leads to a significant degradation in performance, resulting in repetitive and collapsed token output.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

davidherel/collapse-lm-iclr
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Multi-Head Attention · Weight Decay · Adam · Cosine Annealing · Byte Pair Encoding · Softmax · Discriminative Fine-Tuning