Adversarial Distributional Training for Robust Deep Learning

Yinpeng Dong; Zhijie Deng; Tianyu Pang; Hang Su; Jun Zhu

arXiv:2002.05999·cs.LG·November 20, 2020·34 cites

Adversarial Distributional Training for Robust Deep Learning

Yinpeng Dong, Zhijie Deng, Tianyu Pang, Hang Su, Jun Zhu

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces adversarial distributional training (ADT), a new robust learning framework that models adversarial examples as distributions, improving robustness against unseen attacks through a minimax approach and flexible distribution parameterization.

Contribution

The paper proposes ADT, a novel distribution-based adversarial training method formulated as a minimax problem, with algorithms for different distribution parameterizations, enhancing robustness over traditional methods.

Findings

01

ADT outperforms state-of-the-art adversarial training methods on multiple benchmarks.

02

Theoretical analysis supports the effectiveness of the distributional approach.

03

Flexible distribution parameterizations improve robustness against diverse attacks.

Abstract

Adversarial training (AT) is among the most effective techniques to improve model robustness by augmenting training data with adversarial examples. However, most existing AT methods adopt a specific attack to craft adversarial examples, leading to the unreliable robustness against other unseen attacks. Besides, a single attack algorithm could be insufficient to explore the space of perturbations. In this paper, we introduce adversarial distributional training (ADT), a novel framework for learning robust models. ADT is formulated as a minimax optimization problem, where the inner maximization aims to learn an adversarial distribution to characterize the potential adversarial examples around a natural one under an entropic regularizer, and the outer minimization aims to train robust models by minimizing the expected loss over the worst-case adversarial distributions. Through a theoretical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dongyp13/Adversarial-Distributional-Training
pytorchOfficial

Videos

Adversarial Distributional Training for Robust Deep Learning· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications