Memory Dial: A Training Framework for Controllable Memorization in Language Models
Xiangbo Zhang, Ali Emami

TL;DR
Memory Dial is a training framework that explicitly controls memorization in language models, enabling systematic study of how memorization affects model behavior and generalization.
Contribution
It introduces a controllable training method that varies memorization pressure, allowing for precise analysis of memorization effects across models and datasets.
Findings
Memorization pressure can be reliably controlled via the parameter α.
Larger models are more sensitive to changes in memorization pressure.
Frequent sequences are easier for models to memorize than rare ones.
Abstract
Memorization in language models is widely studied but remains difficult to isolate and control. Understanding when and what models memorize is essential for explaining their predictions, yet existing approaches are post-hoc: they can detect memorization in trained models, but cannot disentangle its effects from architecture, data, or optimization. We introduce Memory Dial, a training framework that makes memorization pressure an explicit, controllable variable. Memory Dial interpolates between standard cross-entropy and a temperature-sharpened objective via a single parameter , producing a family of models identical in architecture and training setup (within each sweep), differing only in memorization pressure. Experiments across six architectures and five benchmarks demonstrate that: (1) reliably controls memorization pressure, with seen-example accuracy increasing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
