Adv-4-Adv: Thwarting Changing Adversarial Perturbations via Adversarial Domain Adaptation
Tianyue Zheng, Zhe Chen, Shuya Ding, Chao Cai, Jun Luo

TL;DR
Adv-4-Adv introduces a novel adversarial training approach that uses adversarial domain adaptation to improve robustness against unseen adversarial attacks by learning domain-invariant features.
Contribution
It proposes a new adversarial training method that treats different attack types as domains and employs domain adaptation to enhance attack generalization.
Findings
Model trained with Adv-4-Adv generalizes better to unseen attacks.
Outperforms state-of-the-art methods on multiple datasets.
Effective against simple and advanced adversarial attacks.
Abstract
Whereas adversarial training can be useful against specific adversarial perturbations, they have also proven ineffective in generalizing towards attacks deviating from those used for training. However, we observe that this ineffectiveness is intrinsically connected to domain adaptability, another crucial issue in deep learning for which adversarial domain adaptation appears to be a promising solution. Consequently, we proposed Adv-4-Adv as a novel adversarial training method that aims to retain robustness against unseen adversarial perturbations. Essentially, Adv-4-Adv treats attacks incurring different perturbations as distinct domains, and by leveraging the power of adversarial domain adaptation, it aims to remove the domain/attack-specific features. This forces a trained model to learn a robust domain-invariant representation, which in turn enhances its generalization ability.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Cardiac Arrest and Resuscitation
