Hard-label based Small Query Black-box Adversarial Attack
Jeonghwan Park, Paul Miller, Niall McLaughlin

TL;DR
This paper introduces a new hard-label black-box adversarial attack method that leverages a pretrained surrogate model to significantly improve query efficiency and success rates, especially with limited queries.
Contribution
It proposes a novel hard-label attack approach guided by a surrogate model, enhancing query efficiency over existing methods.
Findings
Achieves approximately 5 times higher attack success rate than benchmarks.
Significantly improves query efficiency across various model architectures.
Effective at small query budgets of 100 and 250.
Abstract
We consider the hard label based black box adversarial attack setting which solely observes predicted classes from the target model. Most of the attack methods in this setting suffer from impractical number of queries required to achieve a successful attack. One approach to tackle this drawback is utilising the adversarial transferability between white box surrogate models and black box target model. However, the majority of the methods adopting this approach are soft label based to take the full advantage of zeroth order optimisation. Unlike mainstream methods, we propose a new practical setting of hard label based attack with an optimisation process guided by a pretrained surrogate model. Experiments show the proposed method significantly improves the query efficiency of the hard label based black-box attack across various target model architectures. We find the proposed method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Hard-Label Based Small Query Black-Box Adversarial Attack· youtube
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Physical Unclonable Functions (PUFs) and Hardware Security
