An Empirical Study of Derivative-Free-Optimization Algorithms for Targeted Black-Box Attacks in Deep Neural Networks
Giuseppe Ughi, Vinayak Abrol, Jared Tanner

TL;DR
This study compares various derivative-free optimization algorithms, including a new BOBYQA-based method, for generating targeted black-box adversarial attacks on deep neural networks under query and perturbation constraints.
Contribution
Introduces a new BOBYQA-based DFO algorithm and provides a comprehensive empirical comparison of four state-of-the-art DFO methods for black-box adversarial attacks.
Findings
Vertex-limited algorithms perform well without defenses
BOBYQA-based algorithm excels at small perturbations
Algorithm effectiveness varies with attack settings
Abstract
We perform a comprehensive study on the performance of derivative free optimization (DFO) algorithms for the generation of targeted black-box adversarial attacks on Deep Neural Network (DNN) classifiers assuming the perturbation energy is bounded by an constraint and the number of queries to the network is limited. This paper considers four pre-existing state-of-the-art DFO-based algorithms along with the introduction of a new algorithm built on BOBYQA, a model-based DFO method. We compare these algorithms in a variety of settings according to the fraction of images that they successfully misclassify given a maximum number of queries to the DNN. The experiments disclose how the likelihood of finding an adversarial example depends on both the algorithm used and the setting of the attack; algorithms limiting the search of adversarial example to the vertices of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
