Towards Out-of-Distribution Adversarial Robustness

Adam Ibrahim; Charles Guille-Escuret; Ioannis Mitliagkas; Irina Rish,; David Krueger; Pouya Bashivan

arXiv:2210.03150·cs.LG·June 27, 2023

Towards Out-of-Distribution Adversarial Robustness

Adam Ibrahim, Charles Guille-Escuret, Ioannis Mitliagkas, Irina Rish,, David Krueger, Pouya Bashivan

PDF

Open Access 1 Repo

TL;DR

This paper proposes a domain generalisation approach using Risk Extrapolation to improve adversarial robustness across multiple attack types, outperforming existing methods especially on unseen attack families.

Contribution

It introduces treating attack types as domains and applying REx to enhance robustness transferability across different adversarial attacks.

Findings

01

Achieves superior worst-case robustness on training attacks.

02

Improves accuracy on unseen attack families at test time.

03

Significantly boosts performance on attack ensembles, e.g., on MNIST and CIFAR10.

Abstract

Adversarial robustness continues to be a major challenge for deep learning. A core issue is that robustness to one type of attack often fails to transfer to other attacks. While prior work establishes a theoretical trade-off in robustness against different $L_{p}$ norms, we show that there is potential for improvement against many commonly used attacks by adopting a domain generalisation approach. Concretely, we treat each type of attack as a domain, and apply the Risk Extrapolation method (REx), which promotes similar levels of robustness against all training attacks. Compared to existing methods, we obtain similar or superior worst-case adversarial robustness on attacks seen during training. Moreover, we achieve superior performance on families or tunings of attacks only encountered at test time. On ensembles of attacks, our approach improves the accuracy from 3.4% with the best…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aiproj/towards-out-of-distribution-adversarial-robustness
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications

MethodsTest