HopSkipJumpAttack: A Query-Efficient Decision-Based Attack

Jianbo Chen; Michael I. Jordan; Martin J. Wainwright

arXiv:1904.02144·cs.LG·April 29, 2020·70 cites

HopSkipJumpAttack: A Query-Efficient Decision-Based Attack

Jianbo Chen, Michael I. Jordan, Martin J. Wainwright

PDF

Open Access 3 Repos

TL;DR

HopSkipJumpAttack is a query-efficient decision-based adversarial attack method that estimates gradient directions using binary outputs, requiring fewer queries and effectively bypassing defenses.

Contribution

It introduces a novel gradient estimation technique for decision-based attacks, improving query efficiency and attack success rates against defended models.

Findings

01

Requires fewer queries than Boundary Attack

02

Achieves competitive attack success rates

03

Effective against various defense mechanisms

Abstract

The goal of a decision-based adversarial attack on a trained model is to generate adversarial examples based solely on observing output labels returned by the targeted model. We develop HopSkipJumpAttack, a family of algorithms based on a novel estimate of the gradient direction using binary information at the decision boundary. The proposed family includes both untargeted and targeted attacks optimized for $ℓ_{2}$ and $ℓ_{\infty}$ similarity metrics respectively. Theoretical analysis is provided for the proposed algorithms and the gradient direction estimate. Experiments show HopSkipJumpAttack requires significantly fewer model queries than Boundary Attack. It also achieves competitive performance in attacking several widely-used defense mechanisms. (HopSkipJumpAttack was named Boundary Attack++ in a previous version of the preprint.)

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning