Towards Million-Scale Adversarial Robustness Evaluation With Stronger   Individual Attacks

Yong Xie; Weijie Zheng; Hanxun Huang; Guangnan Ye; Xingjun; Ma

arXiv:2411.15210·cs.LG·March 12, 2025

Towards Million-Scale Adversarial Robustness Evaluation With Stronger Individual Attacks

Yong Xie, Weijie Zheng, Hanxun Huang, Guangnan Ye, Xingjun, Ma

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper introduces a novel attack method, PMA, for evaluating adversarial robustness of image classifiers, and conducts the first million-scale robustness evaluation revealing significant insights into model vulnerabilities.

Contribution

It proposes the Probability Margin Attack (PMA), a new individual attack method, and performs the first large-scale million-image adversarial robustness evaluation.

Findings

01

PMA outperforms existing individual attack methods.

02

Ensemble attacks can be more effective and efficient.

03

Large-scale evaluation reveals robustness gaps in models.

Abstract

As deep learning models are increasingly deployed in safety-critical applications, evaluating their vulnerabilities to adversarial perturbations is essential for ensuring their reliability and trustworthiness. Over the past decade, a large number of white-box adversarial robustness evaluation methods (i.e., attacks) have been proposed, ranging from single-step to multi-step methods and from individual to ensemble methods. Despite these advances, challenges remain in conducting meaningful and comprehensive robustness evaluations, particularly when it comes to large-scale testing and ensuring evaluations reflect real-world adversarial risks. In this work, we focus on image classification models and propose a novel individual attack method, Probability Margin Attack (PMA), which defines the adversarial margin in the probability space rather than the logits space. We analyze the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xieyong0/PMA
pytorch

Datasets

xingjunm/CC1M-Adv-F
dataset· 80 dl
80 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Bacillus and Francisella bacterial research · Fault Detection and Control Systems

MethodsFocus