Strength-Adaptive Adversarial Training
Chaojian Yu, Dawei Zhou, Li Shen, Jun Yu, Bo Han, Mingming Gong,, Nannan Wang, Tongliang Liu

TL;DR
Strength-Adaptive Adversarial Training (SAAT) dynamically adjusts attack strength during training, improving robustness and balancing natural accuracy and robustness better than fixed-budget methods.
Contribution
SAAT introduces an adaptive perturbation mechanism based on adversarial loss constraints, addressing robustness disparity and overfitting issues in traditional adversarial training.
Findings
SAAT enhances adversarial robustness across various models.
SAAT effectively balances natural accuracy and robustness.
SAAT reduces robust overfitting in adversarial training.
Abstract
Adversarial training (AT) is proved to reliably improve network's robustness against adversarial data. However, current AT with a pre-specified perturbation budget has limitations in learning a robust network. Firstly, applying a pre-specified perturbation budget on networks of various model capacities will yield divergent degree of robustness disparity between natural and robust accuracies, which deviates from robust network's desideratum. Secondly, the attack strength of adversarial training data constrained by the pre-specified perturbation budget fails to upgrade as the growth of network robustness, which leads to robust overfitting and further degrades the adversarial robustness. To overcome these limitations, we propose \emph{Strength-Adaptive Adversarial Training} (SAAT). Specifically, the adversary employs an adversarial loss constraint to generate adversarial training data.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning
