Query-Efficient Hard-label Black-box Attack:An Optimization-based   Approach

Minhao Cheng; Thong Le; Pin-Yu Chen; Jinfeng Yi; Huan Zhang; Cho-Jui; Hsieh

arXiv:1807.04457·cs.LG·July 13, 2018·104 cites

Query-Efficient Hard-label Black-box Attack:An Optimization-based Approach

Minhao Cheng, Thong Le, Pin-Yu Chen, Jinfeng Yi, Huan Zhang, Cho-Jui, Hsieh

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel optimization-based method for hard-label black-box attacks on machine learning models, reducing query complexity and improving attack efficiency across various models and datasets.

Contribution

It formulates the attack as a continuous optimization problem solvable by zeroth order methods, enabling more efficient and theoretically grounded black-box attacks.

Findings

01

Outperforms random walk approach on CNNs for MNIST, CIFAR, ImageNet

02

Effective against discrete models like Gradient Boosting Decision Trees

03

Provides convergence bounds for the proposed optimization algorithm

Abstract

We study the problem of attacking a machine learning model in the hard-label black-box setting, where no model information is revealed except that the attacker can make queries to probe the corresponding hard-label decisions. This is a very challenging problem since the direct extension of state-of-the-art white-box attacks (e.g., CW or PGD) to the hard-label black-box setting will require minimizing a non-continuous step function, which is combinatorial and cannot be solved by a gradient-based optimizer. The only current approach is based on random walk on the boundary, which requires lots of queries and lacks convergence guarantees. We propose a novel way to formulate the hard-label black-box attack as a real-valued optimization problem which is usually continuous and can be solved by any zeroth order optimization algorithm. For example, using the Randomized Gradient-Free method, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cmhcbb/attackbox
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning