AttackBench: Evaluating Gradient-based Attacks for Adversarial Examples

Antonio Emanuele Cin\`a; J\'er\^ome Rony; Maura Pintor; Luca Demetrio; Ambra Demontis; Battista Biggio; Ismail Ben Ayed; Fabio Roli

arXiv:2404.19460·cs.LG·May 13, 2025·2 cites

AttackBench: Evaluating Gradient-based Attacks for Adversarial Examples

Antonio Emanuele Cin\`a, J\'er\^ome Rony, Maura Pintor, Luca Demetrio, Ambra Demontis, Battista Biggio, Ismail Ben Ayed, Fabio Roli

PDF

Open Access 1 Video

TL;DR

AttackBench is a comprehensive evaluation framework that fairly compares gradient-based adversarial attacks by standardizing experimental conditions and measuring effectiveness and efficiency.

Contribution

This work introduces AttackBench, the first framework for fair, standardized comparison of gradient-based adversarial attacks, addressing evaluation biases and implementation issues.

Findings

01

Few attacks outperform all others across configurations.

02

Implementation issues hinder attack effectiveness.

03

AttackBench enables consistent, fair evaluation.

Abstract

Adversarial examples are typically optimized with gradient-based attacks. While novel attacks are continuously proposed, each is shown to outperform its predecessors using different experimental setups, hyperparameter settings, and number of forward and backward calls to the target models. This provides overly-optimistic and even biased evaluations that may unfairly favor one particular attack over the others. In this work, we aim to overcome these limitations by proposing AttackBench, i.e., the first evaluation framework that enables a fair comparison among different attacks. To this end, we first propose a categorization of gradient-based attacks, identifying their main components and differences. We then introduce our framework, which evaluates their effectiveness and efficiency. We measure these characteristics by (i) defining an optimality metric that quantifies how close an attack…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

AttackBench: Evaluating Gradient-based Attacks for Adversarial Examples· underline

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Security and Verification in Computing