Gradient Aligned Attacks via a Few Queries
Xiangyuan Yang, Jie Lin, Hanlin Zhang, Xinyu Yang, Peng Zhao

TL;DR
This paper introduces gradient aligned attacks (GAA) that leverage gradient aligned losses to enhance black-box query attack efficiency and effectiveness, especially under limited query scenarios, by improving gradient estimation accuracy.
Contribution
The paper proposes a novel gradient aligned mechanism and loss functions that significantly improve attack success rates and reduce query counts in black-box attacks with few queries.
Findings
Improved attack success rate by up to 31.3% on ImageNet.
Reduced number of queries needed by up to 2.9 times.
Effective in scenarios with strict query limitations.
Abstract
Black-box query attacks, which rely only on the output of the victim model, have proven to be effective in attacking deep learning models. However, existing black-box query attacks show low performance in a novel scenario where only a few queries are allowed. To address this issue, we propose gradient aligned attacks (GAA), which use the gradient aligned losses (GAL) we designed on the surrogate model to estimate the accurate gradient to improve the attack performance on the victim model. Specifically, we propose a gradient aligned mechanism to ensure that the derivatives of the loss function with respect to the logit vector have the same weight coefficients between the surrogate and victim models. Using this mechanism, we transform the cross-entropy (CE) loss and margin loss into gradient aligned forms, i.e. the gradient aligned CE or margin losses. These losses not only improve the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Boron Compounds in Chemistry · Traumatic Brain Injury and Neurovascular Disturbances
