DUMB and DUMBer: Is Adversarial Training Worth It in the Real World?
Francesco Marchiori, Marco Alecci, Luca Pajola, Mauro Conti

TL;DR
This paper evaluates the real-world robustness of adversarially trained models across diverse tasks and attack methods, revealing practical insights into their effectiveness in security-sensitive applications.
Contribution
Introduces DUMBer, a comprehensive attack framework that systematically assesses adversarial training robustness across multiple models, datasets, and attack algorithms.
Findings
Adversarial training effectiveness varies significantly across models and datasets.
Certain defense techniques show limited robustness against advanced attacks.
The evaluation pipeline captures nuanced behaviors of defenses under diverse conditions.
Abstract
Adversarial examples are small and often imperceptible perturbations crafted to fool machine learning models. These attacks seriously threaten the reliability of deep neural networks, especially in security-sensitive domains. Evasion attacks, a form of adversarial attack where input is modified at test time to cause misclassification, are particularly insidious due to their transferability: adversarial examples crafted against one model often fool other models as well. This property, known as adversarial transferability, complicates defense strategies since it enables black-box attacks to succeed without direct access to the victim model. While adversarial training is one of the most widely adopted defense mechanisms, its effectiveness is typically evaluated on a narrow and homogeneous population of models. This limitation hinders the generalizability of empirical findings and restricts…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
