Learning Black-Box Attackers with Transferable Priors and Query Feedback

Jiancheng Yang; Yangzhou Jiang; Xiaoyang Huang; Bingbing Ni; Chenglong; Zhao

arXiv:2010.11742·cs.CR·October 23, 2020·42 cites

Learning Black-Box Attackers with Transferable Priors and Query Feedback

Jiancheng Yang, Yangzhou Jiang, Xiaoyang Huang, Bingbing Ni, Chenglong, Zhao

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces LeBA, a novel black-box attack method combining transferability and query feedback, significantly reducing queries and maintaining high success rates in attacking vision models, including defenses.

Contribution

It proposes a simple yet effective baseline (SimBA++) and a novel learning scheme (HOGA) to improve black-box attack transferability and query efficiency, surpassing prior methods.

Findings

01

LeBA achieves near 100% success rate on ImageNet.

02

LeBA significantly reduces the number of queries needed.

03

The methods outperform state-of-the-art black-box attacks.

Abstract

This paper addresses the challenging black-box adversarial attack problem, where only classification confidence of a victim model is available. Inspired by consistency of visual saliency between different vision models, a surrogate model is expected to improve the attack performance via transferability. By combining transferability-based and query-based black-box attack, we propose a surprisingly simple baseline approach (named SimBA++) using the surrogate model, which significantly outperforms several state-of-the-art methods. Moreover, to efficiently utilize the query feedback, we update the surrogate model in a novel learning scheme, named High-Order Gradient Approximation (HOGA). By constructing a high-order gradient computation graph, we update the surrogate model to approximate the victim model in both forward and backward pass. The SimBA++ and HOGA result in Learnable Black-Box…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

TrustworthyDL/LeBA
pytorchOfficial

Videos

Learning Black-Box Attackers with Transferable Priors and Query Feedback· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Network Security and Intrusion Detection