You Only Propagate Once: Accelerating Adversarial Training via Maximal   Principle

Dinghuai Zhang; Tianyuan Zhang; Yiping Lu; Zhanxing Zhu; Bin Dong

arXiv:1905.00877·stat.ML·November 4, 2019·80 cites

You Only Propagate Once: Accelerating Adversarial Training via Maximal Principle

Dinghuai Zhang, Tianyuan Zhang, Yiping Lu, Zhanxing Zhu, Bin Dong

PDF

Open Access 2 Repos

TL;DR

This paper introduces YOPO, a novel adversarial training method that significantly reduces computational costs by restricting most network updates to the first layer, while maintaining comparable robustness.

Contribution

The paper formulates adversarial training as a differential game and applies Pontryagin's Maximal Principle to develop YOPO, which minimizes forward and backward passes during adversary updates.

Findings

01

YOPO achieves similar robustness as PGD with 4-5 times less GPU time.

02

Restricting updates to the first layer greatly reduces computational overhead.

03

The approach maintains high accuracy in adversarial defense.

Abstract

Deep learning achieves state-of-the-art results in many tasks in computer vision and natural language processing. However, recent works have shown that deep networks can be vulnerable to adversarial perturbations, which raised a serious robustness issue of deep networks. Adversarial training, typically formulated as a robust optimization problem, is an effective way of improving the robustness of deep networks. A major drawback of existing adversarial training algorithms is the computational overhead of the generation of adversarial examples, typically far greater than that of the network training. This leads to the unbearable overall computational cost of adversarial training. In this paper, we show that adversarial training can be cast as a discrete time differential game. Through analyzing the Pontryagin's Maximal Principle (PMP) of the problem, we observe that the adversary update…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Anomaly Detection Techniques and Applications