Transferability Ranking of Adversarial Examples
Mosh Levy, Guy Amit, Yuval Elovici, Yisroel Mirsky

TL;DR
This paper presents a ranking strategy that predicts the transferability of adversarial examples across models, significantly improving attack success rates without repeated testing on the target system.
Contribution
The proposed method estimates adversarial transferability using surrogate models, reducing the need for trial-and-error testing and increasing attack efficiency.
Findings
Transferability increased from 20% to near 100% in some scenarios.
The strategy predicts success likelihood, enabling better sample selection.
Shared vulnerabilities across diverse models are highlighted.
Abstract
Adversarial transferability in black-box scenarios presents a unique challenge: while attackers can employ surrogate models to craft adversarial examples, they lack assurance on whether these examples will successfully compromise the target model. Until now, the prevalent method to ascertain success has been trial and error-testing crafted samples directly on the victim model. This approach, however, risks detection with every attempt, forcing attackers to either perfect their first try or face exposure. Our paper introduces a ranking strategy that refines the transfer attack process, enabling the attacker to estimate the likelihood of success without repeated trials on the victim's system. By leveraging a set of diverse surrogate models, our method can predict transferability of adversarial examples. This strategy can be used to either select the best sample to use in an attack or the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Machine Learning and Data Classification
