How many perturbations break this model? Evaluating robustness beyond adversarial accuracy
Raphael Olivier, Bhiksha Raj

TL;DR
This paper introduces adversarial sparsity, a new metric for evaluating neural network robustness that captures the difficulty of finding successful perturbations beyond traditional adversarial accuracy, providing deeper insights and guiding robustness improvements.
Contribution
It proposes adversarial sparsity as an alternative robustness measure, revealing differences between models and defenses that accuracy metrics miss, and demonstrates its utility in assessing robustness enhancements.
Findings
Sparsity reveals differences between robust models not seen with accuracy.
Broken defenses can be distinguished by their sparsity profiles.
Data augmentation can improve robustness without affecting accuracy.
Abstract
Robustness to adversarial attacks is typically evaluated with adversarial accuracy. While essential, this metric does not capture all aspects of robustness and in particular leaves out the question of how many perturbations can be found for each point. In this work, we introduce an alternative approach, adversarial sparsity, which quantifies how difficult it is to find a successful perturbation given both an input point and a constraint on the direction of the perturbation. We show that sparsity provides valuable insight into neural networks in multiple ways: for instance, it illustrates important differences between current state-of-the-art robust models them that accuracy analysis does not, and suggests approaches for improving their robustness. When applying broken defenses effective against weak attacks but not strong ones, sparsity can discriminate between the totally ineffective…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
