ProPILE: Probing Privacy Leakage in Large Language Models
Siwon Kim, Sangdoo Yun, Hwaran Lee, Martin Gubri, Sungroh Yoon, Seong, Joon Oh

TL;DR
ProPILE is a probing tool that enables data subjects and service providers to assess potential privacy leakage of personally identifiable information in large language models, promoting awareness and control over data privacy.
Contribution
The paper introduces ProPILE, a novel tool for probing and evaluating PII leakage in large language models, facilitating privacy awareness for users and providers.
Findings
ProPILE effectively assesses PII leakage in the OPT-1.3B model.
Data subjects can evaluate their PII exposure using tailored prompts.
Providers can use ProPILE to measure privacy risks in their models.
Abstract
The rapid advancement and widespread use of large language models (LLMs) have raised significant concerns regarding the potential leakage of personally identifiable information (PII). These models are often trained on vast quantities of web-collected data, which may inadvertently include sensitive personal data. This paper presents ProPILE, a novel probing tool designed to empower data subjects, or the owners of the PII, with awareness of potential PII leakage in LLM-based services. ProPILE lets data subjects formulate prompts based on their own PII to evaluate the level of privacy intrusion in LLMs. We demonstrate its application on the OPT-1.3B model trained on the publicly available Pile dataset. We show how hypothetical data subjects may assess the likelihood of their PII being included in the Pile dataset being revealed. ProPILE can also be leveraged by LLM service providers to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Privacy-Preserving Technologies in Data
Methodstravel james
