Privacy-Aware Decoding: Mitigating Privacy Leakage of Large Language Models in Retrieval-Augmented Generation

Haoran Wang; Xiongxiao Xu; Baixiang Huang; Kai Shu

arXiv:2508.03098·cs.CL·August 6, 2025

Privacy-Aware Decoding: Mitigating Privacy Leakage of Large Language Models in Retrieval-Augmented Generation

Haoran Wang, Xiongxiao Xu, Baixiang Huang, Kai Shu

PDF

TL;DR

This paper introduces Privacy-Aware Decoding (PAD), a novel inference-time method that injects calibrated noise into language model outputs to protect sensitive information during retrieval-augmented generation, balancing privacy and utility.

Contribution

PAD is a model-agnostic, efficient decoding strategy that provides explicit differential privacy guarantees without retraining, reducing privacy leakage in RAG systems.

Findings

01

PAD significantly reduces private information leakage.

02

PAD maintains high response quality compared to existing defenses.

03

PAD operates with minimal computational overhead.

Abstract

Retrieval-Augmented Generation (RAG) enhances the factual accuracy of large language models (LLMs) by conditioning outputs on external knowledge sources. However, when retrieval involves private or sensitive data, RAG systems are susceptible to extraction attacks that can leak confidential information through generated responses. We propose Privacy-Aware Decoding (PAD), a lightweight, inference-time defense that adaptively injects calibrated Gaussian noise into token logits during generation. PAD integrates confidence-based screening to selectively protect high-risk tokens, efficient sensitivity estimation to minimize unnecessary noise, and context-aware noise calibration to balance privacy with generation quality. A \renyi Differential Privacy (RDP) accountant rigorously tracks cumulative privacy loss, enabling explicit per-response $(ε, δ)$ -DP guarantees for sensitive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.