Privacy-Preserving Retrieval-Augmented Generation with Differential Privacy

Tatsuki Koga; Ruihan Wu; Zhiyuan Zhang; Kamalika Chaudhuri

arXiv:2412.04697·cs.CR·November 13, 2025·2 cites

Privacy-Preserving Retrieval-Augmented Generation with Differential Privacy

Tatsuki Koga, Ruihan Wu, Zhiyuan Zhang, Kamalika Chaudhuri

PDF

Open Access

TL;DR

This paper introduces a differentially private retrieval-augmented generation method that balances privacy and accuracy by selectively allocating privacy budget, enabling safe use of sensitive external data with large language models.

Contribution

It proposes a novel algorithm that efficiently manages privacy budget in RAG, improving accuracy under differential privacy constraints.

Findings

01

Outperforms non-RAG baseline under privacy budget of ε≈10

02

Effective privacy-preserving retrieval for large language models

03

Maintains high accuracy with moderate privacy guarantees

Abstract

With the recent remarkable advancement of large language models (LLMs), there has been a growing interest in utilizing them in the domains with highly sensitive data that lies outside their training data. For this purpose, retrieval-augmented generation (RAG) is particularly effective -- it assists LLMs by directly providing relevant information from the external knowledge sources. However, without extra privacy safeguards, RAG outputs risk leaking sensitive information from the external data source. In this work, we explore RAG under differential privacy (DP), a formal guarantee of data privacy. The main challenge with differentially private RAG is how to generate long accurate answers within a moderate privacy budget. We address this by proposing an algorithm that smartly spends privacy budget only for the tokens that require the sensitive information and uses the non-private LLM for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Stochastic Gradient Optimization Techniques

MethodsAttention Is All You Need · Softmax · Byte Pair Encoding · Linear Layer · Linear Warmup With Linear Decay · Multi-Head Attention · Weight Decay · WordPiece · Layer Normalization · Residual Connection