TL;DR
This paper introduces a framework for adaptive text anonymization that automatically optimizes prompts for language models to balance privacy and utility across various domains and requirements.
Contribution
It proposes a novel task formulation and prompt optimization method enabling flexible, domain-aware anonymization strategies for language models.
Findings
Outperforms existing baselines in privacy-utility trade-offs across five diverse datasets.
Achieves comparable performance to larger closed-source models using open-source models.
Discovers novel anonymization strategies exploring different privacy-utility points.
Abstract
Anonymizing textual documents is a highly context-sensitive problem: the appropriate balance between privacy protection and utility preservation varies with the data domain, privacy objectives, and downstream application. However, existing anonymization methods rely on static, manually designed strategies that lack the flexibility to adjust to diverse requirements and often fail to generalize across domains. We introduce adaptive text anonymization, a new task formulation in which anonymization strategies are automatically adapted to specific privacy-utility requirements. We propose a framework for task-specific prompt optimization that automatically constructs anonymization instructions for language models, enabling adaptation to different privacy goals, domains, and downstream usage patterns. To evaluate our approach, we present a benchmark spanning five datasets with diverse domains,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
