Black-box Adversarial Attacks against Dense Retrieval Models: A Multi-view Contrastive Learning Method
Yu-An Liu, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Wei Chen,, Yixing Fan, Xueqi Cheng

TL;DR
This paper introduces a novel black-box adversarial attack method for dense retrieval models, formalizing it as a contrastive learning problem to effectively mislead models with minimal perturbations.
Contribution
It presents the AREA task for attacking dense retrieval models and proposes a contrastive learning-based attack method that outperforms existing strategies.
Findings
Existing NRM attack methods do not generalize well to DR models.
The proposed contrastive learning attack significantly outperforms baseline methods.
Small perturbations can effectively mislead DR models in the proposed framework.
Abstract
Neural ranking models (NRMs) and dense retrieval (DR) models have given rise to substantial improvements in overall retrieval performance. In addition to their effectiveness, and motivated by the proven lack of robustness of deep learning-based approaches in other areas, there is growing interest in the robustness of deep learning-based approaches to the core retrieval problem. Adversarial attack methods that have so far been developed mainly focus on attacking NRMs, with very little attention being paid to the robustness of DR models. In this paper, we introduce the adversarial retrieval attack (AREA) task. The AREA task is meant to trick DR models into retrieving a target document that is outside the initial set of candidate documents retrieved by the DR model in response to a query. We consider the decision-based black-box adversarial setting, which is realistic in real-world search…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · Asian Geopolitics and Ethnography
MethodsContrastive Learning · Focus
