On the Convergence and Robustness of Adversarial Training
Yisen Wang, Xingjun Ma, James Bailey, Jinfeng Yi, Bowen Zhou, Quanquan, Gu

TL;DR
This paper introduces a new criterion, FOSC, to evaluate the convergence of adversarial examples in training, and proposes a dynamic strategy that improves neural network robustness by adjusting adversarial example quality over training stages.
Contribution
It proposes FOSC as a novel measure for convergence quality in adversarial training and develops a dynamic training method that enhances robustness by controlling adversarial example quality.
Findings
FOSC effectively measures convergence quality of adversarial examples.
Using higher-quality adversarial examples later in training improves robustness.
The proposed dynamic strategy outperforms static approaches in robustness metrics.
Abstract
Improving the robustness of deep neural networks (DNNs) to adversarial examples is an important yet challenging problem for secure deep learning. Across existing defense techniques, adversarial training with Projected Gradient Decent (PGD) is amongst the most effective. Adversarial training solves a min-max optimization problem, with the \textit{inner maximization} generating adversarial examples by maximizing the classification loss, and the \textit{outer minimization} finding model parameters by minimizing the loss on adversarial examples generated from the inner maximization. A criterion that measures how well the inner maximization is solved is therefore crucial for adversarial training. In this paper, we propose such a criterion, namely First-Order Stationary Condition for constrained optimization (FOSC), to quantitatively evaluate the convergence quality of adversarial examples…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Integrated Circuits and Semiconductor Failure Analysis
