Contextualized Privacy Defense for LLM Agents

Yule Wen; Yanzhe Zhang; Jianxun Lian; Xiaoyuan Yi; Xing Xie; Diyi Yang

arXiv:2603.02983·cs.CR·March 4, 2026

Contextualized Privacy Defense for LLM Agents

Yule Wen, Yanzhe Zhang, Jianxun Lian, Xiaoyuan Yi, Xing Xie, Diyi Yang

PDF

Open Access 1 Datasets

TL;DR

This paper introduces Contextualized Defense Instructing (CDI), a proactive, context-aware privacy defense framework for LLM agents that outperforms traditional static methods in balancing privacy and helpfulness.

Contribution

The paper presents CDI, a novel reinforcement learning-based privacy defense paradigm that generates step-specific guidance, improving privacy protection in LLM agents.

Findings

01

CDI achieves 94.2% privacy preservation.

02

CDI maintains 80.6% helpfulness.

03

CDI shows superior robustness and generalization.

Abstract

LLM agents increasingly act on users' personal information, yet existing privacy defenses remain limited in both design and adaptability. Most prior approaches rely on static or passive defenses, such as prompting and guarding. These paradigms are insufficient for supporting contextual, proactive privacy decisions in multi-step agent execution. We propose Contextualized Defense Instructing (CDI), a new privacy defense paradigm in which an instructor model generates step-specific, context-aware privacy guidance during execution, proactively shaping actions rather than merely constraining or vetoing them. Crucially, CDI is paired with an experience-driven optimization framework that trains the instructor via reinforcement learning (RL), where we convert failure trajectories with privacy violations into learning environments. We formalize baseline defenses and CDI as distinct intervention…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

SALT-NLP/Contextualized_Privacy_Defense_Trajectory
dataset· 16 dl
16 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Privacy-Preserving Technologies in Data