Prior Aware Memorization: An Efficient Metric for Distinguishing Memorization from Generalization in Large Language Models

Trishita Tiwari; Ari Trachtenberg; G. Edward Suh

arXiv:2602.18733·cs.LG·February 24, 2026

Prior Aware Memorization: An Efficient Metric for Distinguishing Memorization from Generalization in Large Language Models

Trishita Tiwari, Ari Trachtenberg, G. Edward Suh

PDF

Open Access

TL;DR

This paper introduces Prior-Aware Memorization, a lightweight, training-free metric to distinguish genuine memorization from common pattern generation in large language models, addressing privacy and security concerns.

Contribution

It proposes a theoretically grounded criterion that effectively differentiates memorized data from statistically common sequences without retraining models.

Findings

01

55% to 90% of sequences labeled as memorized are statistically common

02

Approximately 40% of sequences in the dataset show common-pattern behavior despite low frequency

03

The metric highlights the importance of considering model priors in leakage assessment

Abstract

Training data leakage from Large Language Models (LLMs) raises serious concerns related to privacy, security, and copyright compliance. A central challenge in assessing this risk is distinguishing genuine memorization of training data from the generation of statistically common sequences. Existing approaches to measuring memorization often conflate these phenomena, labeling outputs as memorized even when they arise from generalization over common patterns. Counterfactual Memorization provides a principled solution by comparing models trained with and without a target sequence, but its reliance on retraining multiple baseline models makes it computationally expensive and impractical at scale. This work introduces Prior-Aware Memorization, a theoretically grounded, lightweight and training-free criterion for identifying genuine memorization in LLMs. The key idea is to evaluate whether a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Adversarial Robustness in Machine Learning