EMPRA: Embedding Perturbation Rank Attack against Neural Ranking Models
Amin Bigdeli, Negar Arabzadeh, Ebrahim Bagheri, Charles L. A. Clarke

TL;DR
EMPRA is a novel adversarial attack method that manipulates sentence embeddings to re-rank documents in neural retrieval systems, effectively promoting target documents without perceptible changes.
Contribution
Introduces EMPRA, a new black-box adversarial attack technique that manipulates embeddings to influence neural ranking models without relying on surrogate models.
Findings
Achieves 96% success in re-ranking target documents into top 10.
Effective against various state-of-the-art neural ranking models.
Does not require surrogate models, increasing attack robustness.
Abstract
Recent research has shown that neural information retrieval techniques may be susceptible to adversarial attacks. Adversarial attacks seek to manipulate the ranking of documents, with the intention of exposing users to targeted content. In this paper, we introduce the Embedding Perturbation Rank Attack (EMPRA) method, a novel approach designed to perform adversarial attacks on black-box Neural Ranking Models (NRMs). EMPRA manipulates sentence-level embeddings, guiding them towards pertinent context related to the query while preserving semantic integrity. This process generates adversarial texts that seamlessly integrate with the original content and remain imperceptible to humans. Our extensive evaluation conducted on the widely-used MS MARCO V1 passage collection demonstrate the effectiveness of EMPRA against a wide range of state-of-the-art baselines in promoting a specific set of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
MethodsSparse Evolutionary Training
