Improving Black-box Adversarial Attacks with a Transfer-based Prior
Shuyu Cheng, Yinpeng Dong, Tianyu Pang, Hang Su, Jun Zhu

TL;DR
This paper introduces a transfer-based prior-guided random gradient-free method to enhance black-box adversarial attacks, achieving higher success rates with fewer queries by effectively combining transfer gradients and query feedback.
Contribution
The paper proposes a novel P-RGF method that integrates transfer-based priors with query information, improving attack efficiency and success rates in black-box settings.
Findings
Requires fewer queries for successful attacks.
Achieves higher attack success rates than existing methods.
Effectively combines transfer gradients with query feedback.
Abstract
We consider the black-box adversarial setting, where the adversary has to generate adversarial perturbations without access to the target models to compute gradients. Previous methods tried to approximate the gradient either by using a transfer gradient of a surrogate white-box model, or based on the query feedback. However, these methods often suffer from low attack success rates or poor query efficiency since it is non-trivial to estimate the gradient in a high-dimensional space with limited information. To address these problems, we propose a prior-guided random gradient-free (P-RGF) method to improve black-box adversarial attacks, which takes the advantage of a transfer-based prior and the query information simultaneously. The transfer-based prior given by the gradient of a surrogate model is appropriately integrated into our algorithm by an optimal coefficient derived by a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Physical Unclonable Functions (PUFs) and Hardware Security · Security and Verification in Computing
