Adversarial Samples Are Not Created Equal

Jennifer Crawford; Amol Khanna; Fred Lu; Amy R. Wagoner; Stella Biderman; Andre T. Nguyen; Edward Raff

arXiv:2601.00577·cs.LG·January 5, 2026

Adversarial Samples Are Not Created Equal

Jennifer Crawford, Amol Khanna, Fred Lu, Amy R. Wagoner, Stella Biderman, Andre T. Nguyen, Edward Raff

PDF

Open Access

TL;DR

This paper distinguishes two types of adversarial samples based on their use of non-robust features, proposing a new metric to analyze their differences and re-evaluating existing robustness phenomena.

Contribution

It introduces an ensemble-based metric to differentiate adversarial samples that exploit non-robust features from those that do not, offering a new perspective on adversarial robustness evaluation.

Findings

01

Differentiates two types of adversarial samples based on feature usage.

02

Re-examines the impact of sharpness-aware minimization on robustness.

03

Analyzes the robustness gap between adversarial training and standard training.

Abstract

Over the past decade, numerous theories have been proposed to explain the widespread vulnerability of deep neural networks to adversarial evasion attacks. Among these, the theory of non-robust features proposed by Ilyas et al. has been widely accepted, showing that brittle but predictive features of the data distribution can be directly exploited by attackers. However, this theory overlooks adversarial samples that do not directly utilize these features. In this work, we advocate that these two kinds of samples - those which use use brittle but predictive features and those that do not - comprise two types of adversarial weaknesses and should be differentiated when evaluating adversarial robustness. For this purpose, we propose an ensemble-based metric to measure the manipulation of non-robust features by adversarial perturbations and use this metric to analyze the makeup of adversarial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Ethics and Social Impacts of AI · Explainable Artificial Intelligence (XAI)