The RAG Paradox: A Black-Box Attack Exploiting Unintentional Vulnerabilities in Retrieval-Augmented Generation Systems
Chanwoo Choi, Jinsoo Kim, Sukmin Cho, Soyeong Jeong, Buru Chang

TL;DR
This paper reveals a structural vulnerability in retrieval-augmented generation systems, where transparency allows attackers to craft poisoned documents that degrade performance while appearing natural, highlighting new security risks.
Contribution
The authors introduce a realistic black-box attack exploiting the RAG paradox, demonstrating how transparency can be used maliciously to undermine RAG system integrity.
Findings
Significantly degrades RAG system performance
Generates natural-looking poisoned documents
Effective in both offline and online settings
Abstract
With the growing adoption of retrieval-augmented generation (RAG) systems, various attack methods have been proposed to degrade their performance. However, most existing approaches rely on unrealistic assumptions in which external attackers have access to internal components such as the retriever. To address this issue, we introduce a realistic black-box attack based on the RAG paradox, a structural vulnerability arising from the system's effort to enhance trust by revealing both the retrieved documents and their sources to users. This transparency enables attackers to observe which sources are used and how information is phrased, allowing them to craft poisoned documents that are more likely to be retrieved and upload them to the identified sources. Moreover, as RAG systems directly provide retrieved content to users, these documents must not only be retrievable but also appear natural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsSmart Grid Security and Resilience · Network Security and Intrusion Detection · Security and Verification in Computing
MethodsAttention Is All You Need · Weight Decay · Attention Dropout · Byte Pair Encoding · Dense Connections · Residual Connection · Linear Layer · Refunds@Expedia|||How do I get a full refund from Expedia? · Linear Warmup With Linear Decay · WordPiece
