Your Attack Is Too DUMB: Formalizing Attacker Scenarios for Adversarial Transferability
Marco Alecci, Mauro Conti, Francesco Marchiori, Luca Martinelli, Luca, Pajola

TL;DR
This paper introduces the DUMB attacker model to analyze how differences in dataset sources, model architectures, and data balance affect the transferability of adversarial attacks across machine learning models, revealing significant performance drops under mismatched conditions.
Contribution
The paper proposes the DUMB framework for systematically analyzing transferability of evasion attacks considering realistic differences between surrogate and victim models.
Findings
Mismatches in dataset source reduce attack transferability.
Differences in model architecture impact attack success rates.
Varying data balance levels significantly affect attack transferability.
Abstract
Evasion attacks are a threat to machine learning models, where adversaries attempt to affect classifiers by injecting malicious samples. An alarming side-effect of evasion attacks is their ability to transfer among different models: this property is called transferability. Therefore, an attacker can produce adversarial samples on a custom model (surrogate) to conduct the attack on a victim's organization later. Although literature widely discusses how adversaries can transfer their attacks, their experimental settings are limited and far from reality. For instance, many experiments consider both attacker and defender sharing the same dataset, balance level (i.e., how the ground truth is distributed), and model architecture. In this work, we propose the DUMB attacker model. This framework allows analyzing if evasion attacks fail to transfer when the training conditions of surrogate and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
Methodsfail
