UnPII: Unlearning Personally Identifiable Information with Quantifiable Exposure Risk
Intae Jeon, Yujeong Kwon, Hyungjoon Koo

TL;DR
UnPII introduces a PII-centric unlearning method that prioritizes forgetting sensitive data based on a comprehensive risk index, improving privacy and utility in large language models.
Contribution
This work presents the first PII-focused unlearning approach using the PII risk index, enabling risk-aware data removal without altering existing unlearning algorithms.
Findings
Achieves up to 11.8% accuracy improvement.
Enhances utility by up to 6.3%.
Reduces exposure risk by up to 12.4%.
Abstract
The ever-increasing adoption of Large Language Models in critical sectors like finance, healthcare, and government raises privacy concerns regarding the handling of sensitive Personally Identifiable Information (PII) during training. In response, regulations such as European Union's General Data Protection Regulation (GDPR) mandate the deletion of PII upon requests, underscoring the need for reliable and cost-effective data removal solutions. Machine unlearning has emerged as a promising direction for selectively forgetting data points. However, existing unlearning techniques typically apply a uniform forgetting strategy that neither accounts for the varying privacy risks posed by different PII attributes nor reflects associated business risks. In this work, we propose UnPII, the first PII-centric unlearning approach that prioritizes forgetting based on the risk of individual or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Privacy, Security, and Data Protection · Data Quality and Management
