Evaluating LLM-based Personal Information Extraction and Countermeasures
Yupei Liu, Yuqi Jia, Jinyuan Jia, Neil Zhenqiang Gong

TL;DR
This paper benchmarks large language models for personal information extraction from profiles, demonstrating their effectiveness and proposing prompt injection as a mitigation strategy.
Contribution
It introduces a framework for LLM-based extraction attacks, new datasets, and evaluates countermeasures like prompt injection against these attacks.
Findings
LLMs can accurately extract personal info from profiles.
LLMs outperform traditional extraction methods.
Prompt injection reduces attack effectiveness.
Abstract
Automatically extracting personal information -- such as name, phone number, and email address -- from publicly available profiles at a large scale is a stepstone to many other security attacks including spear phishing. Traditional methods -- such as regular expression, keyword search, and entity detection -- achieve limited success at such personal information extraction. In this work, we perform a systematic measurement study to benchmark large language model (LLM) based personal information extraction and countermeasures. Towards this goal, we present a framework for LLM-based extraction attacks; collect four datasets including a synthetic dataset generated by GPT-4 and three real-world datasets with manually labeled eight categories of personal information; introduce a novel mitigation strategy based on prompt injection; and systematically benchmark LLM-based attacks and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
