Adversarial Attacks against Neural Ranking Models via In-Context Learning

Amin Bigdeli; Negar Arabzadeh; Ebrahim Bagheri; Charles L. A. Clarke

arXiv:2508.15283·cs.IR·August 22, 2025

Adversarial Attacks against Neural Ranking Models via In-Context Learning

Amin Bigdeli, Negar Arabzadeh, Ebrahim Bagheri, Charles L. A. Clarke

PDF

Open Access

TL;DR

This paper introduces FSAP, a novel black-box adversarial attack framework using in-context learning of LLMs to generate misleading documents that can manipulate neural ranking models, highlighting a significant security threat.

Contribution

The paper presents FSAP, a new prompting-based attack method that does not require model access and can generate effective adversarial documents for neural ranking systems.

Findings

01

FSAP-generated documents outperform credible content in ranking.

02

Adversarial documents exhibit strong stance alignment and low detectability.

03

FSAP generalizes across different LLMs and query types.

Abstract

While neural ranking models (NRMs) have shown high effectiveness, they remain susceptible to adversarial manipulation. In this work, we introduce Few-Shot Adversarial Prompting (FSAP), a novel black-box attack framework that leverages the in-context learning capabilities of Large Language Models (LLMs) to generate high-ranking adversarial documents. Unlike previous approaches that rely on token-level perturbations or manual rewriting of existing documents, FSAP formulates adversarial attacks entirely through few-shot prompting, requiring no gradient access or internal model instrumentation. By conditioning the LLM on a small support set of previously observed harmful examples, FSAP synthesizes grammatically fluent and topically coherent documents that subtly embed false or misleading information and rank competitively against authentic content. We instantiate FSAP in two modes:…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Topic Modeling · Misinformation and Its Impacts