Adversarial Distributional Training for Robust Deep Learning
Yinpeng Dong, Zhijie Deng, Tianyu Pang, Hang Su, Jun Zhu

TL;DR
This paper introduces adversarial distributional training (ADT), a new robust learning framework that models adversarial examples as distributions, improving robustness against unseen attacks through a minimax approach and flexible distribution parameterization.
Contribution
The paper proposes ADT, a novel distribution-based adversarial training method formulated as a minimax problem, with algorithms for different distribution parameterizations, enhancing robustness over traditional methods.
Findings
ADT outperforms state-of-the-art adversarial training methods on multiple benchmarks.
Theoretical analysis supports the effectiveness of the distributional approach.
Flexible distribution parameterizations improve robustness against diverse attacks.
Abstract
Adversarial training (AT) is among the most effective techniques to improve model robustness by augmenting training data with adversarial examples. However, most existing AT methods adopt a specific attack to craft adversarial examples, leading to the unreliable robustness against other unseen attacks. Besides, a single attack algorithm could be insufficient to explore the space of perturbations. In this paper, we introduce adversarial distributional training (ADT), a novel framework for learning robust models. ADT is formulated as a minimax optimization problem, where the inner maximization aims to learn an adversarial distribution to characterize the potential adversarial examples around a natural one under an entropic regularizer, and the outer minimization aims to train robust models by minimizing the expected loss over the worst-case adversarial distributions. Through a theoretical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
