Improving the Generalization of Adversarial Training with Domain Adaptation
Chuanbiao Song, Kun He, Liwei Wang, John E. Hopcroft

TL;DR
This paper introduces ATDA, a novel adversarial training method using domain adaptation to improve model robustness and generalization against various adversarial attacks, especially FGSM and iterative methods.
Contribution
Proposes ATDA, a domain adaptation-based adversarial training approach that enhances generalization across different attack types and improves robustness of deep learning models.
Findings
ATDA significantly outperforms existing methods on benchmark datasets.
The method improves model smoothness and robustness against multiple attack types.
Extension to iterative attacks like PGD further enhances defense performance.
Abstract
By injecting adversarial examples into training data, adversarial training is promising for improving the robustness of deep learning models. However, most existing adversarial training approaches are based on a specific type of adversarial attack. It may not provide sufficiently representative samples from the adversarial domain, leading to a weak generalization ability on adversarial examples from other attacks. Moreover, during the adversarial training, adversarial perturbations on inputs are usually crafted by fast single-step adversaries so as to scale to large datasets. This work is mainly focused on the adversarial training yet efficient FGSM adversary. In this scenario, it is difficult to train a model with great generalization due to the lack of representative adversarial samples, aka the samples are unable to accurately reflect the adversarial domain. To alleviate this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Anomaly Detection Techniques and Applications
