Personal Information Parroting in Language Models

Nishant Subramani; Kshitish Ghate; Mona Diab

arXiv:2602.20580·cs.CL·February 25, 2026

Personal Information Parroting in Language Models

Nishant Subramani, Kshitish Ghate, Mona Diab

PDF

Open Access 2 Videos

TL;DR

This paper introduces a new detector suite for personal information in language models and finds that larger models and more pretraining increase memorization, emphasizing the need for better data filtering.

Contribution

The work develops a superior regex-based detector suite for personal information and provides a comprehensive analysis of memorization across various model sizes and training stages.

Findings

01

13.6% of PI is verbatim memorized by Pythia-6.9b

02

Model size and pretraining steps positively correlate with memorization

03

Smallest model parrots 2.7% of PI instances

Abstract

Modern language models (LM) are trained on large scrapes of the Web, containing millions of personal information (PI) instances, many of which LMs memorize, increasing privacy risks. In this work, we develop the regexes and rules (R&R) detector suite to detect email addresses, phone numbers, and IP addresses, which outperforms the best regex-based PI detectors. On a manually curated set of 483 instances of PI, we measure memorization: finding that 13.6% are parroted verbatim by the Pythia-6.9b model, i.e., when the model is prompted with the tokens that precede the PI in the original document, greedy decoding generates the entire PI span exactly. We expand this analysis to study models of varying sizes (160M-6.9B) and pretraining time steps (70k-143k iterations) in the Pythia model suite and find that both model size and amount of pretraining are positively correlated with memorization.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Personal Information Parroting in Language Models· underline

Taxonomy

TopicsAuthorship Attribution and Profiling · Spam and Phishing Detection · Mental Health via Writing