Generating Adversarial Inputs Using A Black-box Differential Technique
Jo\~ao Batista Pereira Matos Ju\'unior, Lucas Carvalho Cordeiro,, Marcelo d'Amorim, Xiaowei Huang

TL;DR
This paper introduces DAEGEN, a novel black-box differential technique for generating adversarial inputs that highlight differences between two neural network models, demonstrating superior effectiveness, precision, and efficiency over existing methods.
Contribution
DAEGEN is the first black-box method to generate difference-inducing adversarial examples by optimizing input perturbations to maximize model prediction differences.
Findings
DAEGEN successfully generates adversarial examples across multiple datasets and models.
It outperforms existing white-box differential techniques in effectiveness, precision, and efficiency.
It surpasses state-of-the-art black-box adversarial attack methods in differential settings.
Abstract
Neural Networks (NNs) are known to be vulnerable to adversarial attacks. A malicious agent initiates these attacks by perturbing an input into another one such that the two inputs are classified differently by the NN. In this paper, we consider a special class of adversarial examples, which can exhibit not only the weakness of NN models - as do for the typical adversarial examples - but also the different behavior between two NN models. We call them difference-inducing adversarial examples or DIAEs. Specifically, we propose DAEGEN, the first black-box differential technique for adversarial input generation. DAEGEN takes as input two NN models of the same classification problem and reports on output an adversarial example. The obtained adversarial example is a DIAE, so that it represents a point-wise difference in the input space between the two NN models. Algorithmically, DAEGEN uses a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Machine Learning in Materials Science
Methods1x1 Convolution · Batch Normalization · *Communicated@Fast*How Do I Communicate to Expedia? · Bottleneck Residual Block · Max Pooling · Average Pooling · Convolution · Residual Connection · Global Average Pooling · Residual Block
