Hard-label based Small Query Black-box Adversarial Attack

Jeonghwan Park; Paul Miller; Niall McLaughlin

arXiv:2403.06014·cs.LG·March 12, 2024·1 cites

Hard-label based Small Query Black-box Adversarial Attack

Jeonghwan Park, Paul Miller, Niall McLaughlin

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a new hard-label black-box adversarial attack method that leverages a pretrained surrogate model to significantly improve query efficiency and success rates, especially with limited queries.

Contribution

It proposes a novel hard-label attack approach guided by a surrogate model, enhancing query efficiency over existing methods.

Findings

01

Achieves approximately 5 times higher attack success rate than benchmarks.

02

Significantly improves query efficiency across various model architectures.

03

Effective at small query budgets of 100 and 250.

Abstract

We consider the hard label based black box adversarial attack setting which solely observes predicted classes from the target model. Most of the attack methods in this setting suffer from impractical number of queries required to achieve a successful attack. One approach to tackle this drawback is utilising the adversarial transferability between white box surrogate models and black box target model. However, the majority of the methods adopting this approach are soft label based to take the full advantage of zeroth order optimisation. Unlike mainstream methods, we propose a new practical setting of hard label based attack with an optimisation process guided by a pretrained surrogate model. Experiments show the proposed method significantly improves the query efficiency of the hard label based black-box attack across various target model architectures. We find the proposed method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jpark04-qub/sqba
pytorchOfficial

Videos

Hard-Label Based Small Query Black-Box Adversarial Attack· youtube

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Physical Unclonable Functions (PUFs) and Hardware Security