Testing Robustness Against Unforeseen Adversaries

Max Kaufmann; Daniel Kang; Yi Sun; Steven Basart; Xuwang Yin; Mantas; Mazeika; Akul Arora; Adam Dziedzic; Franziska Boenisch; Tom Brown; Jacob; Steinhardt; Dan Hendrycks

arXiv:1908.08016·cs.LG·October 31, 2023·82 cites

Testing Robustness Against Unforeseen Adversaries

Max Kaufmann, Daniel Kang, Yi Sun, Steven Basart, Xuwang Yin, Mantas, Mazeika, Akul Arora, Adam Dziedzic, Franziska Boenisch, Tom Brown, Jacob, Steinhardt, Dan Hendrycks

PDF

Open Access 3 Repos

TL;DR

This paper introduces ImageNet-UA, a framework to evaluate model robustness against diverse, unforeseen adversarial attacks beyond traditional L_p perturbations, highlighting gaps in current defenses and proposing new methods for improved robustness.

Contribution

The paper presents ImageNet-UA, a new benchmark for testing robustness against diverse, unforeseen attacks, and demonstrates that existing defenses are insufficient against such threats.

Findings

01

Existing robustness measures fail to predict unforeseen attack resilience.

02

Standard robustness techniques are outperformed by alternative training strategies.

03

Novel methods can enhance robustness against diverse, unseen adversaries.

Abstract

Adversarial robustness research primarily focuses on L_p perturbations, and most defenses are developed with identical training-time and test-time adversaries. However, in real-world applications developers are unlikely to have access to the full range of attacks or corruptions their system will face. Furthermore, worst-case inputs are likely to be diverse and need not be constrained to the L_p ball. To narrow in on this discrepancy between research and reality we introduce ImageNet-UA, a framework for evaluating model robustness against a range of unforeseen adversaries, including eighteen new non-L_p attacks. To perform well on ImageNet-UA, defenses must overcome a generalization gap and be robust to a diverse attacks not encountered during training. In extensive experiments, we find that existing robustness measures do not capture unforeseen robustness, that standard robustness…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Integrated Circuits and Semiconductor Failure Analysis