Learning Black-Box Attackers with Transferable Priors and Query Feedback
Jiancheng Yang, Yangzhou Jiang, Xiaoyang Huang, Bingbing Ni, Chenglong, Zhao

TL;DR
This paper introduces LeBA, a novel black-box attack method combining transferability and query feedback, significantly reducing queries and maintaining high success rates in attacking vision models, including defenses.
Contribution
It proposes a simple yet effective baseline (SimBA++) and a novel learning scheme (HOGA) to improve black-box attack transferability and query efficiency, surpassing prior methods.
Findings
LeBA achieves near 100% success rate on ImageNet.
LeBA significantly reduces the number of queries needed.
The methods outperform state-of-the-art black-box attacks.
Abstract
This paper addresses the challenging black-box adversarial attack problem, where only classification confidence of a victim model is available. Inspired by consistency of visual saliency between different vision models, a surrogate model is expected to improve the attack performance via transferability. By combining transferability-based and query-based black-box attack, we propose a surprisingly simple baseline approach (named SimBA++) using the surrogate model, which significantly outperforms several state-of-the-art methods. Moreover, to efficiently utilize the query feedback, we update the surrogate model in a novel learning scheme, named High-Order Gradient Approximation (HOGA). By constructing a high-order gradient computation graph, we update the surrogate model to approximate the victim model in both forward and backward pass. The SimBA++ and HOGA result in Learnable Black-Box…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Network Security and Intrusion Detection
