On the Convergence and Robustness of Adversarial Training

Yisen Wang; Xingjun Ma; James Bailey; Jinfeng Yi; Bowen Zhou; Quanquan; Gu

arXiv:2112.08304·cs.LG·April 26, 2022·183 cites

On the Convergence and Robustness of Adversarial Training

Yisen Wang, Xingjun Ma, James Bailey, Jinfeng Yi, Bowen Zhou, Quanquan, Gu

PDF

Open Access

TL;DR

This paper introduces a new criterion, FOSC, to evaluate the convergence of adversarial examples in training, and proposes a dynamic strategy that improves neural network robustness by adjusting adversarial example quality over training stages.

Contribution

It proposes FOSC as a novel measure for convergence quality in adversarial training and develops a dynamic training method that enhances robustness by controlling adversarial example quality.

Findings

01

FOSC effectively measures convergence quality of adversarial examples.

02

Using higher-quality adversarial examples later in training improves robustness.

03

The proposed dynamic strategy outperforms static approaches in robustness metrics.

Abstract

Improving the robustness of deep neural networks (DNNs) to adversarial examples is an important yet challenging problem for secure deep learning. Across existing defense techniques, adversarial training with Projected Gradient Decent (PGD) is amongst the most effective. Adversarial training solves a min-max optimization problem, with the \textit{inner maximization} generating adversarial examples by maximizing the classification loss, and the \textit{outer minimization} finding model parameters by minimizing the loss on adversarial examples generated from the inner maximization. A criterion that measures how well the inner maximization is solved is therefore crucial for adversarial training. In this paper, we propose such a criterion, namely First-Order Stationary Condition for constrained optimization (FOSC), to quantitatively evaluate the convergence quality of adversarial examples…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Integrated Circuits and Semiconductor Failure Analysis