Copy and Paste: A Simple But Effective Initialization Method for Black-Box Adversarial Attacks
Thomas Brunner, Frederik Diehl, Alois Knoll

TL;DR
This paper introduces a simple initialization method for black-box adversarial attacks that significantly reduces query counts by copying small patches from other images, improving attack efficiency.
Contribution
It reveals the importance of initialization in black-box attacks and proposes a straightforward patch-copying strategy that enhances query efficiency.
Findings
Reduces queries for Boundary Attack by 81%.
Outperforms previous targeted attack results on ImageNet.
Highlights the impact of initialization on attack success.
Abstract
Many optimization methods for generating black-box adversarial examples have been proposed, but the aspect of initializing said optimizers has not been considered in much detail. We show that the choice of starting points is indeed crucial, and that the performance of state-of-the-art attacks depends on it. First, we discuss desirable properties of starting points for attacking image classifiers, and how they can be chosen to increase query efficiency. Notably, we find that simply copying small patches from other images is a valid strategy. We then present an evaluation on ImageNet that clearly demonstrates the effectiveness of this method: Our initialization scheme reduces the number of queries required for a state-of-the-art Boundary Attack by 81%, significantly outperforming previous results reported for targeted black-box adversarial examples.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Anomaly Detection Techniques and Applications
