Learning to Defend by Learning to Attack

Haoming Jiang; Zhehui Chen; Yuyang Shi; Bo Dai; and Tuo Zhao

arXiv:1811.01213·cs.LG·May 4, 2021·30 cites

Learning to Defend by Learning to Attack

Haoming Jiang, Zhehui Chen, Yuyang Shi, Bo Dai, and Tuo Zhao

PDF

Open Access

TL;DR

This paper introduces a novel adversarial training approach using a learnable optimizer within a bilevel optimization framework, improving robustness and efficiency in neural network training.

Contribution

It proposes a learning-to-learn framework that replaces hand-designed inner optimization algorithms with a neural network-based optimizer for adversarial training.

Findings

01

Outperforms existing methods in accuracy on CIFAR datasets

02

Enhances computational efficiency of adversarial training

03

Extends to generative adversarial imitation learning

Abstract

Adversarial training provides a principled approach for training robust neural networks. From an optimization perspective, adversarial training is essentially solving a bilevel optimization problem. The leader problem is trying to learn a robust classifier, while the follower problem is trying to generate adversarial samples. Unfortunately, such a bilevel problem is difficult to solve due to its highly complicated structure. This work proposes a new adversarial training method based on a generic learning-to-learn (L2L) framework. Specifically, instead of applying existing hand-designed algorithms for the inner problem, we learn an optimizer, which is parametrized as a convolutional neural network. At the same time, a robust classifier is learned to defense the adversarial attack generated by the learned optimizer. Experiments over CIFAR-10 and CIFAR-100 datasets demonstrate that L2L…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Anomaly Detection Techniques and Applications