Improving the Generalization of Adversarial Training with Domain   Adaptation

Chuanbiao Song; Kun He; Liwei Wang; John E. Hopcroft

arXiv:1810.00740·cs.LG·March 18, 2019·47 cites

Improving the Generalization of Adversarial Training with Domain Adaptation

Chuanbiao Song, Kun He, Liwei Wang, John E. Hopcroft

PDF

Open Access 2 Repos

TL;DR

This paper introduces ATDA, a novel adversarial training method using domain adaptation to improve model robustness and generalization against various adversarial attacks, especially FGSM and iterative methods.

Contribution

Proposes ATDA, a domain adaptation-based adversarial training approach that enhances generalization across different attack types and improves robustness of deep learning models.

Findings

01

ATDA significantly outperforms existing methods on benchmark datasets.

02

The method improves model smoothness and robustness against multiple attack types.

03

Extension to iterative attacks like PGD further enhances defense performance.

Abstract

By injecting adversarial examples into training data, adversarial training is promising for improving the robustness of deep learning models. However, most existing adversarial training approaches are based on a specific type of adversarial attack. It may not provide sufficiently representative samples from the adversarial domain, leading to a weak generalization ability on adversarial examples from other attacks. Moreover, during the adversarial training, adversarial perturbations on inputs are usually crafted by fast single-step adversaries so as to scale to large datasets. This work is mainly focused on the adversarial training yet efficient FGSM adversary. In this scenario, it is difficult to train a model with great generalization due to the lack of representative adversarial samples, aka the samples are unable to accurately reflect the adversarial domain. To alleviate this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Anomaly Detection Techniques and Applications