On the Role of Generalization in Transferability of Adversarial Examples
Yilin Wang, Farzan Farnia

TL;DR
This paper investigates how the generalization properties of substitute neural networks influence the transferability of black-box adversarial examples, providing theoretical bounds and empirical evidence for improved attack success rates.
Contribution
It introduces a theoretical framework linking substitute network generalization to transferability and demonstrates the effectiveness of Lipschitz regularization in enhancing attack transferability.
Findings
Better generalization of substitute networks leads to more transferable adversarial examples.
Lipschitz regularization improves the transferability of adversarial attacks.
Theoretical bounds relate transferability to generalization performance.
Abstract
Black-box adversarial attacks designing adversarial examples for unseen neural networks (NNs) have received great attention over the past years. While several successful black-box attack schemes have been proposed in the literature, the underlying factors driving the transferability of black-box adversarial examples still lack a thorough understanding. In this paper, we aim to demonstrate the role of the generalization properties of the substitute classifier used for generating adversarial examples in the transferability of the attack scheme to unobserved NN classifiers. To do this, we apply the max-min adversarial example game framework and show the importance of the generalization properties of the substitute NN in the success of the black-box attack scheme in application to different NN classifiers. We prove theoretical generalization bounds on the difference between the attack…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Machine Learning in Materials Science · Fault Detection and Control Systems
MethodsTest
