Benchmarking Poisoning Attacks against Retrieval-Augmented Generation
Baolei Zhang, Haoran Xin, Jiatong Li, Dongzhe Zhang, Minghong Fang, Zhuqing Liu, Lihai Nie, Zheli Liu

TL;DR
This paper introduces the first comprehensive benchmark for evaluating poisoning attacks against Retrieval-Augmented Generation (RAG) systems, revealing vulnerabilities across various architectures and the ineffectiveness of current defenses.
Contribution
It presents a broad benchmark framework covering multiple datasets, attack methods, and defenses, and provides a thorough evaluation of poisoning attack effectiveness on diverse RAG architectures.
Findings
Attack effectiveness drops on expanded datasets.
All tested RAG architectures remain vulnerable.
Current defenses are insufficient against poisoning attacks.
Abstract
Retrieval-Augmented Generation (RAG) has proven effective in mitigating hallucinations in large language models by incorporating external knowledge during inference. However, this integration introduces new security vulnerabilities, particularly to poisoning attacks. Although prior work has explored various poisoning strategies, a thorough assessment of their practical threat to RAG systems remains missing. To address this gap, we propose the first comprehensive benchmark framework for evaluating poisoning attacks on RAG. Our benchmark covers 5 standard question answering (QA) datasets and 10 expanded variants, along with 13 poisoning attack methods and 7 defense mechanisms, representing a broad spectrum of existing techniques. Using this benchmark, we conduct a comprehensive evaluation of all included attacks and defenses across the full dataset spectrum. Our findings show that while…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInternet Traffic Analysis and Secure E-voting · Caching and Content Delivery · Algorithms and Data Compression
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Attention Dropout · Softmax · WordPiece · Weight Decay · Multi-Head Attention · Layer Normalization · Byte Pair Encoding
