Evaluating LLM-based Personal Information Extraction and Countermeasures

Yupei Liu; Yuqi Jia; Jinyuan Jia; Neil Zhenqiang Gong

arXiv:2408.07291·cs.CR·April 8, 2026

Evaluating LLM-based Personal Information Extraction and Countermeasures

Yupei Liu, Yuqi Jia, Jinyuan Jia, Neil Zhenqiang Gong

PDF

TL;DR

This paper benchmarks large language models for personal information extraction from profiles, demonstrating their effectiveness and proposing prompt injection as a mitigation strategy.

Contribution

It introduces a framework for LLM-based extraction attacks, new datasets, and evaluates countermeasures like prompt injection against these attacks.

Findings

01

LLMs can accurately extract personal info from profiles.

02

LLMs outperform traditional extraction methods.

03

Prompt injection reduces attack effectiveness.

Abstract

Automatically extracting personal information -- such as name, phone number, and email address -- from publicly available profiles at a large scale is a stepstone to many other security attacks including spear phishing. Traditional methods -- such as regular expression, keyword search, and entity detection -- achieve limited success at such personal information extraction. In this work, we perform a systematic measurement study to benchmark large language model (LLM) based personal information extraction and countermeasures. Towards this goal, we present a framework for LLM-based extraction attacks; collect four datasets including a synthetic dataset generated by GPT-4 and three real-world datasets with manually labeled eight categories of personal information; introduce a novel mitigation strategy based on prompt injection; and systematically benchmark LLM-based attacks and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.