Adaptive Text Anonymization: Learning Privacy-Utility Trade-offs via Prompt Optimization

Gabriel Loiseau; Damien Sileo; Damien Riquet; Maxime Meyer; Marc Tommasi

arXiv:2602.20743·cs.CL·April 21, 2026

Adaptive Text Anonymization: Learning Privacy-Utility Trade-offs via Prompt Optimization

Gabriel Loiseau, Damien Sileo, Damien Riquet, Maxime Meyer, Marc Tommasi

PDF

1 Models

TL;DR

This paper introduces a framework for adaptive text anonymization that automatically optimizes prompts for language models to balance privacy and utility across various domains and requirements.

Contribution

It proposes a novel task formulation and prompt optimization method enabling flexible, domain-aware anonymization strategies for language models.

Findings

01

Outperforms existing baselines in privacy-utility trade-offs across five diverse datasets.

02

Achieves comparable performance to larger closed-source models using open-source models.

03

Discovers novel anonymization strategies exploring different privacy-utility points.

Abstract

Anonymizing textual documents is a highly context-sensitive problem: the appropriate balance between privacy protection and utility preservation varies with the data domain, privacy objectives, and downstream application. However, existing anonymization methods rely on static, manually designed strategies that lack the flexibility to adjust to diverse requirements and often fail to generalize across domains. We introduce adaptive text anonymization, a new task formulation in which anonymization strategies are automatically adapted to specific privacy-utility requirements. We propose a framework for task-specific prompt optimization that automatically constructs anonymization instructions for language models, enabling adaptation to different privacy goals, domains, and downstream usage patterns. To evaluate our approach, we present a benchmark spanning five datasets with diverse domains,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
Dinegonos/slm-rag-anonymization-tram
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.