Poisoning Retrieval Corpora by Injecting Adversarial Passages

Zexuan Zhong; Ziqing Huang; Alexander Wettig; Danqi Chen

arXiv:2310.19156·cs.CL·October 31, 2023·1 cites

Poisoning Retrieval Corpora by Injecting Adversarial Passages

Zexuan Zhong, Ziqing Huang, Alexander Wettig, Danqi Chen

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel adversarial attack on dense retrieval systems, demonstrating that injecting a small number of carefully crafted passages can significantly degrade retrieval accuracy across various datasets and models.

Contribution

The work presents a new attack method that generates adversarial passages to fool dense retrievers, revealing vulnerabilities and generalization capabilities of these systems.

Findings

01

Adversarial passages can mislead retrieval for unseen queries.

02

Attack success rate exceeds 94% on out-of-domain queries.

03

Injecting 500 passages can compromise systems with millions of entries.

Abstract

Dense retrievers have achieved state-of-the-art performance in various information retrieval tasks, but to what extent can they be safely deployed in real-world applications? In this work, we propose a novel attack for dense retrieval systems in which a malicious user generates a small number of adversarial passages by perturbing discrete tokens to maximize similarity with a provided set of training queries. When these adversarial passages are inserted into a large retrieval corpus, we show that this attack is highly effective in fooling these systems to retrieve them for queries that were not seen by the attacker. More surprisingly, these adversarial passages can directly generalize to out-of-domain queries and corpora with a high success attack rate -- for instance, we find that 50 generated passages optimized on Natural Questions can mislead >94% of questions posed in financial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

princeton-nlp/corpus-poisoning
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning

MethodsSparse Evolutionary Training