Privacy-Aware Decoding: Mitigating Privacy Leakage of Large Language Models in Retrieval-Augmented Generation
Haoran Wang, Xiongxiao Xu, Baixiang Huang, Kai Shu

TL;DR
This paper introduces Privacy-Aware Decoding (PAD), a novel inference-time method that injects calibrated noise into language model outputs to protect sensitive information during retrieval-augmented generation, balancing privacy and utility.
Contribution
PAD is a model-agnostic, efficient decoding strategy that provides explicit differential privacy guarantees without retraining, reducing privacy leakage in RAG systems.
Findings
PAD significantly reduces private information leakage.
PAD maintains high response quality compared to existing defenses.
PAD operates with minimal computational overhead.
Abstract
Retrieval-Augmented Generation (RAG) enhances the factual accuracy of large language models (LLMs) by conditioning outputs on external knowledge sources. However, when retrieval involves private or sensitive data, RAG systems are susceptible to extraction attacks that can leak confidential information through generated responses. We propose Privacy-Aware Decoding (PAD), a lightweight, inference-time defense that adaptively injects calibrated Gaussian noise into token logits during generation. PAD integrates confidence-based screening to selectively protect high-risk tokens, efficient sensitivity estimation to minimize unnecessary noise, and context-aware noise calibration to balance privacy with generation quality. A \renyi Differential Privacy (RDP) accountant rigorously tracks cumulative privacy loss, enabling explicit per-response -DP guarantees for sensitive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
