PoisonArena: Uncovering Competing Poisoning Attacks in Retrieval-Augmented Generation

Liuji Chen; Xiaofang Yang; Yuanzhuo Lu; Jinghao Zhang; Xin Sun; Qiang Liu; Shu Wu; Jing Dong; Liang Wang

arXiv:2505.12574·cs.IR·June 4, 2025

PoisonArena: Uncovering Competing Poisoning Attacks in Retrieval-Augmented Generation

Liuji Chen, Xiaofang Yang, Yuanzhuo Lu, Jinghao Zhang, Xin Sun, Qiang Liu, Shu Wu, Jing Dong, Liang Wang

PDF

Open Access 1 Repo

TL;DR

PoisonArena introduces a benchmark for evaluating competing poisoning attacks in Retrieval-Augmented Generation systems, revealing the limitations of existing methods and emphasizing the importance of multi-adversary scenarios for robust defense development.

Contribution

This work formalizes a multi-attacker threat model for RAG systems and provides the first benchmark to evaluate competing poisoning attacks in realistic adversarial environments.

Findings

01

Many isolated attack strategies fail under competitive pressure.

02

Traditional metrics like ASR and F1 are insufficient for real-world robustness.

03

PoisonArena enables standardized benchmarking for future defenses.

Abstract

Retrieval-Augmented Generation (RAG) systems, widely used to improve the factual grounding of large language models (LLMs), are increasingly vulnerable to poisoning attacks, where adversaries inject manipulated content into the retriever's corpus. While prior research has predominantly focused on single-attacker settings, real-world scenarios often involve multiple, competing attackers with conflicting objectives. In this work, we introduce PoisonArena, the first benchmark to systematically study and evaluate competing poisoning attacks in RAG. We formalize the multi-attacker threat model, where attackers vie to control the answer to the same query using mutually exclusive misinformation. PoisonArena leverages the Bradley-Terry model to quantify each method's competitive effectiveness in such adversarial environments. Through extensive experiments on the Natural Questions and MS MARCO…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yxf203/poisonarena
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Graph Neural Networks · Adversarial Robustness in Machine Learning

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Warmup With Linear Decay · Layer Normalization · Softmax · Attention Dropout · WordPiece · Residual Connection · Linear Layer · Byte Pair Encoding