SD-RAG: A Prompt-Injection-Resilient Framework for Selective Disclosure in Retrieval-Augmented Generation
Aiman Al Masoud, Marco Arazzi, Antonino Nocera

TL;DR
SD-RAG introduces a retrieval-phase security mechanism for RAG systems, enhancing privacy and resilience against prompt injection attacks by applying sanitization and policy-aware retrieval before generation.
Contribution
The paper presents SD-RAG, a novel framework that enforces privacy and security during retrieval, decoupling constraints from generation, and supports fine-grained, policy-aware data retrieval.
Findings
Achieves up to 58% improvement in privacy score.
Demonstrates strong resilience to prompt injection attacks.
Outperforms baseline approaches in experimental evaluations.
Abstract
Retrieval-Augmented Generation (RAG) has attracted significant attention due to its ability to combine the generative capabilities of Large Language Models (LLMs) with knowledge obtained through efficient retrieval mechanisms over large-scale data collections. Currently, the majority of existing approaches overlook the risks associated with exposing sensitive or access-controlled information directly to the generation model. Only a few approaches propose techniques to instruct the generative model to refrain from disclosing sensitive information; however, recent studies have also demonstrated that LLMs remain vulnerable to prompt injection attacks that can override intended behavioral constraints. For these reasons, we propose a novel approach to Selective Disclosure in Retrieval-Augmented Generation, called SD-RAG, which decouples the enforcement of security and privacy constraints…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Privacy, Security, and Data Protection · Topic Modeling
