Subspace Adversarial Training
Tao Li, Yingwen Wu, Sizhe Chen, Kun Fang, Xiaolin Huang

TL;DR
This paper introduces Subspace Adversarial Training (Sub-AT), a novel method that constrains adversarial training within a carefully extracted subspace to prevent overfitting and significantly improve robustness against strong attacks.
Contribution
The paper proposes a new adversarial training approach, Sub-AT, which controls gradient growth by constraining training in a subspace, effectively resolving overfitting and boosting robustness.
Findings
Achieves over 51% robust accuracy against PGD-50 attack on CIFAR-10.
Effectively prevents catastrophic overfitting in single-step adversarial training.
Provides state-of-the-art robustness with computational efficiency.
Abstract
Single-step adversarial training (AT) has received wide attention as it proved to be both efficient and robust. However, a serious problem of catastrophic overfitting exists, i.e., the robust accuracy against projected gradient descent (PGD) attack suddenly drops to 0% during the training. In this paper, we approach this problem from a novel perspective of optimization and firstly reveal the close link between the fast-growing gradient of each sample and overfitting, which can also be applied to understand robust overfitting in multi-step AT. To control the growth of the gradient, we propose a new AT method, Subspace Adversarial Training (Sub-AT), which constrains AT in a carefully extracted subspace. It successfully resolves both kinds of overfitting and significantly boosts the robustness. In subspace, we also allow single-step AT with larger steps and larger radius, further improving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Integrated Circuits and Semiconductor Failure Analysis · Anomaly Detection Techniques and Applications
