Testing Robustness Against Unforeseen Adversaries
Max Kaufmann, Daniel Kang, Yi Sun, Steven Basart, Xuwang Yin, Mantas, Mazeika, Akul Arora, Adam Dziedzic, Franziska Boenisch, Tom Brown, Jacob, Steinhardt, Dan Hendrycks

TL;DR
This paper introduces ImageNet-UA, a framework to evaluate model robustness against diverse, unforeseen adversarial attacks beyond traditional L_p perturbations, highlighting gaps in current defenses and proposing new methods for improved robustness.
Contribution
The paper presents ImageNet-UA, a new benchmark for testing robustness against diverse, unforeseen attacks, and demonstrates that existing defenses are insufficient against such threats.
Findings
Existing robustness measures fail to predict unforeseen attack resilience.
Standard robustness techniques are outperformed by alternative training strategies.
Novel methods can enhance robustness against diverse, unseen adversaries.
Abstract
Adversarial robustness research primarily focuses on L_p perturbations, and most defenses are developed with identical training-time and test-time adversaries. However, in real-world applications developers are unlikely to have access to the full range of attacks or corruptions their system will face. Furthermore, worst-case inputs are likely to be diverse and need not be constrained to the L_p ball. To narrow in on this discrepancy between research and reality we introduce ImageNet-UA, a framework for evaluating model robustness against a range of unforeseen adversaries, including eighteen new non-L_p attacks. To perform well on ImageNet-UA, defenses must overcome a generalization gap and be robust to a diverse attacks not encountered during training. In extensive experiments, we find that existing robustness measures do not capture unforeseen robustness, that standard robustness…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Integrated Circuits and Semiconductor Failure Analysis
