The Effects of Image Distribution and Task on Adversarial Robustness
Owen Kunhardt, Arturo Deza, Tomaso Poggio

TL;DR
This paper introduces an adapted AUC metric to evaluate adversarial robustness across different models, image distributions, and tasks, revealing how these factors influence model robustness and learned representations.
Contribution
We propose a new AUC-based metric for unbiased comparison of adversarial robustness across models, distributions, and tasks, and analyze its implications on learned representations.
Findings
CIFAR-10 models are less robust than MNIST models.
Image distribution and task significantly affect adversarial robustness.
Pretraining on different distributions/tasks can influence robustness transfer.
Abstract
In this paper, we propose an adaptation to the area under the curve (AUC) metric to measure the adversarial robustness of a model over a particular -interval (interval of adversarial perturbation strengths) that facilitates unbiased comparisons across models when they have different initial performance. This can be used to determine how adversarially robust a model is to different image distributions or task (or some other variable); and/or to measure how robust a model is comparatively to other models. We used this adversarial robustness metric on models of an MNIST, CIFAR-10, and a Fusion dataset (CIFAR-10 + MNIST) where trained models performed either a digit or object recognition task using a LeNet, ResNet50, or a fully connected network (FullyConnectedNet) architecture and found the following: 1) CIFAR-10 models are inherently less…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Bacillus and Francisella bacterial research · Integrated Circuits and Semiconductor Failure Analysis
