$\ell_\infty$-Robustness and Beyond: Unleashing Efficient Adversarial Training
Hadi M. Dolatabadi, Sarah Erfani, Christopher Leckie

TL;DR
This paper introduces a data subset selection method based on coreset theory to accelerate various adversarial training techniques, achieving 2-3 times faster training with minimal accuracy loss.
Contribution
It presents a general, principled approach to reduce adversarial training time by selecting small data subsets, applicable to multiple training objectives beyond $ ext{l}_ ext{infty}$ attacks.
Findings
Speeds up adversarial training by 2-3 times.
Applicable to TRADES, $ ext{l}_p$-PGD, and PAT.
Maintains comparable accuracy with slight reductions.
Abstract
Neural networks are vulnerable to adversarial attacks: adding well-crafted, imperceptible perturbations to their input can modify their output. Adversarial training is one of the most effective approaches in training robust models against such attacks. However, it is much slower than vanilla training of neural networks since it needs to construct adversarial examples for the entire training data at every iteration, hampering its effectiveness. Recently, Fast Adversarial Training (FAT) was proposed that can obtain robust models efficiently. However, the reasons behind its success are not fully understood, and more importantly, it can only train robust models for -bounded attacks as it uses FGSM during training. In this paper, by leveraging the theory of coreset selection, we show how selecting a small subset of training data provides a general, more principled approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Ethics and Social Impacts of AI · Stochastic Gradient Optimization Techniques
