Adversarial Robustness Limits via Scaling-Law and Human-Alignment   Studies

Brian R. Bartoldson; James Diffenderfer; Konstantinos Parasyris,; Bhavya Kailkhura

arXiv:2404.09349·cs.LG·July 11, 2024·2 cites

Adversarial Robustness Limits via Scaling-Law and Human-Alignment Studies

Brian R. Bartoldson, James Diffenderfer, Konstantinos Parasyris,, Bhavya Kailkhura

PDF

Open Access 1 Repo

TL;DR

This paper develops scaling laws for adversarial robustness in image classifiers, revealing efficiency gaps in current methods, predicting a robustness plateau near 90%, and highlighting human performance limits due to attack constraints.

Contribution

It introduces the first scaling laws for adversarial training, identifies inefficiencies in prior methods, and predicts a robustness ceiling around 90% with implications for future research.

Findings

01

Scaling laws reveal inefficiencies in prior adversarial training methods.

02

Achieved 74% AutoAttack accuracy with compute-efficient models.

03

Predicted robustness plateau near 90%, matching human performance limits.

Abstract

This paper revisits the simple, long-studied, yet still unsolved problem of making image classifiers robust to imperceptible perturbations. Taking CIFAR10 as an example, SOTA clean accuracy is about $100$ %, but SOTA robustness to $ℓ_{\infty}$ -norm bounded perturbations barely exceeds $70$ %. To understand this gap, we analyze how model size, dataset size, and synthetic data quality affect robustness by developing the first scaling laws for adversarial training. Our scaling laws reveal inefficiencies in prior art and provide actionable feedback to advance the field. For instance, we discovered that SOTA methods diverge notably from compute-optimal setups, using excess compute for their level of robustness. Leveraging a compute-efficient setup, we surpass the prior SOTA with $20$ % ( $70$ %) fewer training (inference) FLOPs. We trained various compute-efficient models, with our best…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bbartoldson/adversarial-robustness-limits
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · COVID-19 diagnosis using AI