How many perturbations break this model? Evaluating robustness beyond   adversarial accuracy

Raphael Olivier; Bhiksha Raj

arXiv:2207.04129·cs.LG·August 14, 2023

How many perturbations break this model? Evaluating robustness beyond adversarial accuracy

Raphael Olivier, Bhiksha Raj

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces adversarial sparsity, a new metric for evaluating neural network robustness that captures the difficulty of finding successful perturbations beyond traditional adversarial accuracy, providing deeper insights and guiding robustness improvements.

Contribution

It proposes adversarial sparsity as an alternative robustness measure, revealing differences between models and defenses that accuracy metrics miss, and demonstrates its utility in assessing robustness enhancements.

Findings

01

Sparsity reveals differences between robust models not seen with accuracy.

02

Broken defenses can be distinguished by their sparsity profiles.

03

Data augmentation can improve robustness without affecting accuracy.

Abstract

Robustness to adversarial attacks is typically evaluated with adversarial accuracy. While essential, this metric does not capture all aspects of robustness and in particular leaves out the question of how many perturbations can be found for each point. In this work, we introduce an alternative approach, adversarial sparsity, which quantifies how difficult it is to find a successful perturbation given both an input point and a constraint on the direction of the perturbation. We show that sparsity provides valuable insight into neural networks in multiple ways: for instance, it illustrates important differences between current state-of-the-art robust models them that accuracy analysis does not, and suggests approaches for improving their robustness. When applying broken defenses effective against weak attacks but not strong ones, sparsity can discriminate between the totally ineffective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

raphaelolivier/sparsity
pytorchOfficial

Videos

How Many Perturbations Break This Model? Evaluating Robustness Beyond Adversarial Accuracy· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications