Minimally distorted Adversarial Examples with a Fast Adaptive Boundary   Attack

Francesco Croce; Matthias Hein

arXiv:1907.02044·cs.LG·July 21, 2020·121 cites

Minimally distorted Adversarial Examples with a Fast Adaptive Boundary Attack

Francesco Croce, Matthias Hein

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper introduces a fast, adaptive white-box adversarial attack that efficiently finds minimally distorted examples across multiple norms, outperforming or matching state-of-the-art methods and resisting gradient masking effects.

Contribution

The paper presents a novel attack method with geometric intuition that quickly generates high-quality, minimally perturbed adversarial examples for multiple norms, improving robustness evaluation.

Findings

01

Performs better or similar to existing attacks across $l_p$-norms.

02

Efficiently computes minimal perturbations with a single run.

03

Resistant to gradient masking phenomena.

Abstract

The evaluation of robustness against adversarial manipulation of neural networks-based classifiers is mainly tested with empirical attacks as methods for the exact computation, even when available, do not scale to large networks. We propose in this paper a new white-box adversarial attack wrt the $l_{p}$ -norms for $p \in {1, 2, \infty}$ aiming at finding the minimal perturbation necessary to change the class of a given input. It has an intuitive geometric meaning, yields quickly high quality results, minimizes the size of the perturbation (so that it returns the robust accuracy at every threshold with a single run). It performs better or similar to state-of-the-art attacks which are partially specialized to one $l_{p}$ -norm, and is robust to the phenomenon of gradient masking.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Minimally distorted Adversarial Examples with a Fast Adaptive Boundary Attack· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Integrated Circuits and Semiconductor Failure Analysis · Bacillus and Francisella bacterial research