RobustMask: Certified Robustness against Adversarial Neural Ranking Attack via Randomized Masking

Jiawei Liu; Zhuo Chen; Rui Zhu; Miaokun Chen; Yuyang Gong; Wei Lu; Xiaofeng Wang

arXiv:2512.23307·cs.CR·December 30, 2025

RobustMask: Certified Robustness against Adversarial Neural Ranking Attack via Randomized Masking

Jiawei Liu, Zhuo Chen, Rui Zhu, Miaokun Chen, Yuyang Gong, Wei Lu, Xiaofeng Wang

PDF

Open Access

TL;DR

RobustMask is a novel defense mechanism that enhances neural ranking models' robustness against character-, word-, and phrase-level adversarial attacks by combining pretrained language models with randomized masking, providing certified top-K robustness.

Contribution

It introduces RobustMask, a new method that offers certified robustness for neural ranking models against various adversarial perturbations using a probabilistic smoothing technique.

Findings

01

Certifies over 20% of top-10 candidates against 30% content perturbation.

02

Demonstrates effectiveness in defending against character, word, and phrase-level attacks.

03

Provides theoretical proof of certified top-K robustness.

Abstract

Neural ranking models have achieved remarkable progress and are now widely deployed in real-world applications such as Retrieval-Augmented Generation (RAG). However, like other neural architectures, they remain vulnerable to adversarial manipulations: subtle character-, word-, or phrase-level perturbations can poison retrieval results and artificially promote targeted candidates, undermining the integrity of search engines and downstream systems. Existing defenses either rely on heuristics with poor generalization or on certified methods that assume overly strong adversarial knowledge, limiting their practical use. To address these challenges, we propose RobustMask, a novel defense that combines the context-prediction capability of pretrained language models with a randomized masking-based smoothing mechanism. Our approach strengthens neural ranking models against adversarial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Topic Modeling · Multimodal Machine Learning Applications