Reliable evaluation of adversarial robustness with an ensemble of diverse parameter-free attacks
Francesco Croce, Matthias Hein

TL;DR
This paper introduces a parameter-free ensemble of diverse attacks to reliably evaluate adversarial robustness, revealing that many existing defenses are less robust than previously reported.
Contribution
It proposes two extensions to PGD-attack and combines them with existing attacks to form a robust, parameter-free ensemble for evaluating adversarial defenses.
Findings
Most evaluated models are less robust than originally claimed.
The ensemble uncovers broken defenses in recent models.
Evaluation results often show overestimated robustness in prior studies.
Abstract
The field of defense strategies against adversarial attacks has significantly grown over the last years, but progress is hampered as the evaluation of adversarial defenses is often insufficient and thus gives a wrong impression of robustness. Many promising defenses could be broken later on, making it difficult to identify the state-of-the-art. Frequent pitfalls in the evaluation are improper tuning of hyperparameters of the attacks, gradient obfuscation or masking. In this paper we first propose two extensions of the PGD-attack overcoming failures due to suboptimal step size and problems of the objective function. We then combine our novel attacks with two complementary existing ones to form a parameter-free, computationally affordable and user-independent ensemble of attacks to test adversarial robustness. We apply our ensemble to over 50 models from papers published at recent top…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Bacillus and Francisella bacterial research · Integrated Circuits and Semiconductor Failure Analysis
