Adversarial Robustness Limits via Scaling-Law and Human-Alignment Studies
Brian R. Bartoldson, James Diffenderfer, Konstantinos Parasyris,, Bhavya Kailkhura

TL;DR
This paper develops scaling laws for adversarial robustness in image classifiers, revealing efficiency gaps in current methods, predicting a robustness plateau near 90%, and highlighting human performance limits due to attack constraints.
Contribution
It introduces the first scaling laws for adversarial training, identifies inefficiencies in prior methods, and predicts a robustness ceiling around 90% with implications for future research.
Findings
Scaling laws reveal inefficiencies in prior adversarial training methods.
Achieved 74% AutoAttack accuracy with compute-efficient models.
Predicted robustness plateau near 90%, matching human performance limits.
Abstract
This paper revisits the simple, long-studied, yet still unsolved problem of making image classifiers robust to imperceptible perturbations. Taking CIFAR10 as an example, SOTA clean accuracy is about %, but SOTA robustness to -norm bounded perturbations barely exceeds %. To understand this gap, we analyze how model size, dataset size, and synthetic data quality affect robustness by developing the first scaling laws for adversarial training. Our scaling laws reveal inefficiencies in prior art and provide actionable feedback to advance the field. For instance, we discovered that SOTA methods diverge notably from compute-optimal setups, using excess compute for their level of robustness. Leveraging a compute-efficient setup, we surpass the prior SOTA with % (%) fewer training (inference) FLOPs. We trained various compute-efficient models, with our best…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · COVID-19 diagnosis using AI
