Robust Deep Learning as Optimal Control: Insights and Convergence   Guarantees

Jacob H. Seidman; Mahyar Fazlyab; Victor M. Preciado; George J. Pappas

arXiv:2005.00616·math.OC·May 5, 2020·1 cites

Robust Deep Learning as Optimal Control: Insights and Convergence Guarantees

Jacob H. Seidman, Mahyar Fazlyab, Victor M. Preciado, George J. Pappas

PDF

Open Access

TL;DR

This paper analyzes the convergence of a robust adversarial training method by framing it as an optimal control problem, offering theoretical guarantees and insights into hyperparameter effects, supported by experimental validation.

Contribution

It provides the first convergence analysis of an adversarial training algorithm using optimal control and inexact oracle methods, enhancing understanding of its stability and efficiency.

Findings

01

Convergence guarantees depend on hyperparameter choices.

02

Optimal control perspective improves training efficiency.

03

Experimental results validate theoretical insights.

Abstract

The fragility of deep neural networks to adversarially-chosen inputs has motivated the need to revisit deep learning algorithms. Including adversarial examples during training is a popular defense mechanism against adversarial attacks. This mechanism can be formulated as a min-max optimization problem, where the adversary seeks to maximize the loss function using an iterative first-order algorithm while the learner attempts to minimize it. However, finding adversarial examples in this way causes excessive computational overhead during training. By interpreting the min-max problem as an optimal control problem, it has recently been shown that one can exploit the compositional structure of neural networks in the optimization problem to improve the training time significantly. In this paper, we provide the first convergence analysis of this adversarial training algorithm by combining…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Machine Learning and Algorithms