Evaluating the Evaluators: Trust in Adversarial Robustness Tests

Antonio Emanuele Cin\`a; Maura Pintor; Luca Demetrio; Ambra Demontis; Battista Biggio; and Fabio Roli

arXiv:2507.03450·cs.CR·July 8, 2025

Evaluating the Evaluators: Trust in Adversarial Robustness Tests

Antonio Emanuele Cin\`a, Maura Pintor, Luca Demetrio, Ambra Demontis, Battista Biggio, and Fabio Roli

PDF

TL;DR

This paper introduces AttackBench, a benchmark framework that standardizes and improves the reliability of adversarial attack evaluations, helping to ensure more trustworthy robustness assessments.

Contribution

The paper presents AttackBench, a comprehensive benchmark for evaluating gradient-based adversarial attacks under consistent conditions, enhancing evaluation reliability and reproducibility.

Findings

01

AttackBench ranks attack implementations based on a novel optimality metric.

02

It enforces standardized testing conditions for fair comparisons.

03

The framework supports continuous updates for evolving robustness evaluation.

Abstract

Despite significant progress in designing powerful adversarial evasion attacks for robustness verification, the evaluation of these methods often remains inconsistent and unreliable. Many assessments rely on mismatched models, unverified implementations, and uneven computational budgets, which can lead to biased results and a false sense of security. Consequently, robustness claims built on such flawed testing protocols may be misleading and give a false sense of security. As a concrete step toward improving evaluation reliability, we present AttackBench, a benchmark framework developed to assess the effectiveness of gradient-based attacks under standardized and reproducible conditions. AttackBench serves as an evaluation tool that ranks existing attack implementations based on a novel optimality metric, which enables researchers and practitioners to identify the most reliable and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.