Defense of Adversarial Ranking Attack in Text Retrieval: Benchmark and Baseline via Detection
Xuanang Chen, Ben He, Le Sun, Yingfei Sun

TL;DR
This paper introduces a benchmark and baseline detection methods for defending neural ranking models against adversarial document attacks in text retrieval, highlighting the effectiveness and limitations of supervised classifiers.
Contribution
It establishes a benchmark dataset for adversarial ranking defense and evaluates detection methods, revealing the strengths and weaknesses of supervised classifiers against known and unseen attacks.
Findings
Supervised classifiers effectively detect known adversarial attacks.
Detection performance drops significantly on unseen attacks.
Avoiding query text in classifiers prevents relevance bias.
Abstract
Neural ranking models (NRMs) have undergone significant development and have become integral components of information retrieval (IR) systems. Unfortunately, recent research has unveiled the vulnerability of NRMs to adversarial document manipulations, potentially exploited by malicious search engine optimization practitioners. While progress in adversarial attack strategies aids in identifying the potential weaknesses of NRMs before their deployment, the defensive measures against such attacks, like the detection of adversarial documents, remain inadequately explored. To mitigate this gap, this paper establishes a benchmark dataset to facilitate the investigation of adversarial ranking defense and introduces two types of detection tasks for adversarial documents. A comprehensive investigation of the performance of several detection baselines is conducted, which involve examining the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Misinformation and Its Impacts · Terrorism, Counterterrorism, and Political Violence
