Preventing Catastrophic Overfitting in Fast Adversarial Training: A Bi-level Optimization Perspective
Zhaoxin Wang, Handing Wang, Cong Tian, Yaochu Jin

TL;DR
This paper introduces FGSM-PCO, a novel fast adversarial training method that prevents catastrophic overfitting by using historical adversarial examples and an adaptive fusion mechanism, improving robustness on complex tasks.
Contribution
The paper proposes FGSM-PCO, a new FAT approach that mitigates overfitting by incorporating historical AEs and adaptive mechanisms within a bi-level optimization framework.
Findings
FGSM-PCO effectively prevents catastrophic overfitting.
It improves robustness across multiple models and datasets.
Empirical results outperform existing FAT algorithms.
Abstract
Adversarial training (AT) has become an effective defense method against adversarial examples (AEs) and it is typically framed as a bi-level optimization problem. Among various AT methods, fast AT (FAT), which employs a single-step attack strategy to guide the training process, can achieve good robustness against adversarial attacks at a low cost. However, FAT methods suffer from the catastrophic overfitting problem, especially on complex tasks or with large-parameter models. In this work, we propose a FAT method termed FGSM-PCO, which mitigates catastrophic overfitting by averting the collapse of the inner optimization problem in the bi-level optimization process. FGSM-PCO generates current-stage AEs from the historical AEs and incorporates them into the training process using an adaptive mechanism. This mechanism determines an appropriate fusion ratio according to the performance of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsManufacturing Process and Optimization · Integrated Circuits and Semiconductor Failure Analysis · Industrial Vision Systems and Defect Detection
