AgentStealth: Reinforcing Large Language Model for Anonymizing User-generated Text

Chenyang Shao; Tianxing Li; Chenhao Pu; Fengli Xu; Yong Li

arXiv:2506.22508·cs.CL·July 1, 2025

AgentStealth: Reinforcing Large Language Model for Anonymizing User-generated Text

Chenyang Shao, Tianxing Li, Chenhao Pu, Fengli Xu, Yong Li

PDF

Open Access 1 Repo

TL;DR

AgentStealth introduces a self-reinforcing framework for anonymizing user-generated text using locally deployed language models, balancing privacy protection and utility without relying on cloud-based solutions.

Contribution

It proposes a novel self-reinforcing anonymization framework combining adversarial workflows, contrastive learning, and reinforcement learning for effective local text anonymization.

Findings

01

Outperforms baselines in anonymization effectiveness (+12.3%)

02

Enhances utility preservation (+6.8%)

03

Supports deployment on edge devices

Abstract

In today's digital world, casual user-generated content often contains subtle cues that may inadvertently expose sensitive personal attributes. Such risks underscore the growing importance of effective text anonymization to safeguard individual privacy. However, existing methods either rely on rigid replacements that damage utility or cloud-based LLMs that are costly and pose privacy risks. To address these issues, we explore the use of locally deployed smaller-scale language models (SLMs) for anonymization. Yet training effective SLMs remains challenging due to limited high-quality supervision. To address the challenge, we propose AgentStealth, a self-reinforcing LLM anonymization framework.First, we introduce an adversarial anonymization workflow enhanced by In-context Contrastive Learning and Adaptive Utility-Aware Control. Second, we perform supervised adaptation of SLMs using…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tsinghua-fib-lab/agentstealth
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Privacy, Security, and Data Protection · Adversarial Robustness in Machine Learning

MethodsContrastive Learning