Generating Black-Box Adversarial Examples in Sparse Domain
Hadi Zanddizari, Behnam Zeinali, and J. Morris Chang

TL;DR
This paper introduces a novel black-box adversarial attack method in the sparse domain, leveraging LaS components to efficiently fool multiple image classifiers with fewer queries.
Contribution
It proposes a new approach that perturbs LaS components in the sparse domain, demonstrating effectiveness against various classifiers and providing theoretical insights into perturbation metrics.
Findings
Successfully fools six state-of-the-art classifiers
Uses fewer queries than existing methods
Provides theoretical connection between perturbation metrics and sparse domain
Abstract
Applications of machine learning (ML) models and convolutional neural networks (CNNs) have been rapidly increased. Although state-of-the-art CNNs provide high accuracy in many applications, recent investigations show that such networks are highly vulnerable to adversarial attacks. The black-box adversarial attack is one type of attack that the attacker does not have any knowledge about the model or the training dataset, but it has some input data set and their labels. In this paper, we propose a novel approach to generate a black-box attack in sparse domain whereas the most important information of an image can be observed. Our investigation shows that large sparse (LaS) components play a critical role in the performance of image classifiers. Under this presumption, to generate adversarial example, we transfer an image into a sparse domain and put a threshold to choose only k LaS…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
