Emergent and Predictable Memorization in Large Language Models
Stella Biderman, USVSN Sai Prashanth, Lintang Sutawika and, Hailey Schoelkopf, Quentin Anthony, Shivanshu Purohit, Edward Raff

TL;DR
This paper investigates how large language models memorize data, especially sensitive information, and proposes methods to predict memorization behavior early in training to improve safety and reliability.
Contribution
It introduces a scaling law-based approach to forecast memorization in large models from lower-compute trials, aiding safer deployment.
Findings
Memorization scores follow predictable scaling laws.
Early predictions can reliably estimate full-model memorization.
Distribution of memorization varies across models and data.
Abstract
Memorization, or the tendency of large language models (LLMs) to output entire sequences from their training data verbatim, is a key concern for safely deploying language models. In particular, it is vital to minimize a model's memorization of sensitive datapoints such as those containing personal identifiable information (PII). The prevalence of such undesirable memorization can pose issues for model trainers, and may even require discarding an otherwise functional model. We therefore seek to predict which sequences will be memorized before a large model's full train-time by extrapolating the memorization behavior of lower-compute trial runs. We measure memorization of the Pythia model suite and plot scaling laws for forecasting memorization, allowing us to provide equi-compute recommendations to maximize the reliability (recall) of such predictions. We additionally provide further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning in Healthcare
MethodsPythia
