Adversarial Training for Gradient Descent: Analysis Through its   Continuous-time Approximation

Haotian Gu; Xin Guo; Xinyu Li

arXiv:2105.08037·cs.LG·May 25, 2023

Adversarial Training for Gradient Descent: Analysis Through its Continuous-time Approximation

Haotian Gu, Xin Guo, Xinyu Li

PDF

Open Access

TL;DR

This paper introduces a continuous-time approximation for adversarial training, providing analytical insights into its robustness and differences from standard gradient descent, supported by theoretical and numerical evidence.

Contribution

It offers a novel continuous-time framework to analyze adversarial training, enhancing understanding of its robustness compared to traditional gradient descent methods.

Findings

01

Adversarial training can be approximated by a continuous-time gradient flow.

02

The analysis confirms the robustness of adversarial training.

03

Numerical examples support the theoretical findings.

Abstract

Adversarial training has gained great popularity as one of the most effective defenses for deep neural network and more generally for gradient-based machine learning models against adversarial perturbations on data points. This paper establishes a continuous-time approximation for the mini-max game of adversarial training. This approximation approach allows for precise and analytical comparisons between stochastic gradient descent and its adversarial training counterpart; and confirms theoretically the robustness of adversarial training from a new gradient-flow viewpoint. The analysis is then corroborated through various analytical and numerical examples.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Model Reduction and Neural Networks