Query-Efficient Hard-label Black-box Attack:An Optimization-based Approach
Minhao Cheng, Thong Le, Pin-Yu Chen, Jinfeng Yi, Huan Zhang, Cho-Jui, Hsieh

TL;DR
This paper introduces a novel optimization-based method for hard-label black-box attacks on machine learning models, reducing query complexity and improving attack efficiency across various models and datasets.
Contribution
It formulates the attack as a continuous optimization problem solvable by zeroth order methods, enabling more efficient and theoretically grounded black-box attacks.
Findings
Outperforms random walk approach on CNNs for MNIST, CIFAR, ImageNet
Effective against discrete models like Gradient Boosting Decision Trees
Provides convergence bounds for the proposed optimization algorithm
Abstract
We study the problem of attacking a machine learning model in the hard-label black-box setting, where no model information is revealed except that the attacker can make queries to probe the corresponding hard-label decisions. This is a very challenging problem since the direct extension of state-of-the-art white-box attacks (e.g., CW or PGD) to the hard-label black-box setting will require minimizing a non-continuous step function, which is combinatorial and cannot be solved by a gradient-based optimizer. The only current approach is based on random walk on the boundary, which requires lots of queries and lacks convergence guarantees. We propose a novel way to formulate the hard-label black-box attack as a real-valued optimization problem which is usually continuous and can be solved by any zeroth order optimization algorithm. For example, using the Randomized Gradient-Free method, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
