Scaling Laws and Interpretability of Learning from Repeated Data

Danny Hernandez; Tom Brown; Tom Conerly; Nova DasSarma; Dawn Drain,; Sheer El-Showk; Nelson Elhage; Zac Hatfield-Dodds; Tom Henighan; Tristan; Hume; Scott Johnston; Ben Mann; Chris Olah; Catherine Olsson; Dario Amodei,; Nicholas Joseph; Jared Kaplan; Sam McCandlish

arXiv:2205.10487·cs.LG·May 24, 2022·22 cites

Scaling Laws and Interpretability of Learning from Repeated Data

Danny Hernandez, Tom Brown, Tom Conerly, Nova DasSarma, Dawn Drain,, Sheer El-Showk, Nelson Elhage, Zac Hatfield-Dodds, Tom Henighan, Tristan, Hume, Scott Johnston, Ben Mann, Chris Olah, Catherine Olsson, Dario Amodei,, Nicholas Joseph, Jared Kaplan, Sam McCandlish

PDF

Open Access

TL;DR

This paper investigates how repeated data in training large language models causes performance degradation, revealing a double descent phenomenon and linking it to internal model structures like induction heads.

Contribution

It systematically studies the effects of data repetition, demonstrating severe performance degradation and connecting it to mechanistic interpretability insights about model internals.

Findings

01

Repeated data causes a double descent in test loss.

02

Repetition of 0.1% of data can degrade performance to that of a smaller model.

03

Data repetition damages copying mechanisms and induction heads.

Abstract

Recent large language models have been trained on vast datasets, but also often on repeated data, either intentionally for the purpose of upweighting higher quality data, or unintentionally because data deduplication is not perfect and the model is exposed to repeated data at the sentence, paragraph, or document level. Some works have reported substantial negative performance effects of this repeated data. In this paper we attempt to study repeated data systematically and to understand its effects mechanistically. To do this, we train a family of models where most of the data is unique but a small fraction of it is repeated many times. We find a strong double descent phenomenon, in which repeated data can lead test loss to increase midway through training. A predictable range of repetition frequency leads to surprisingly severe degradation in performance. For instance, performance of an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Natural Language Processing Techniques