TL;DR
This paper introduces a convex programming framework for generating adversarial examples in deep learning, capable of explaining existing methods and creating new algorithms with competitive fooling ratios.
Contribution
It presents a novel convex programming approach for adversarial example generation that unifies and explains existing methods while enabling the design of new algorithms.
Findings
Framework can recover variants of existing adversarial methods.
Proposed algorithms achieve competitive fooling ratios.
Framework allows for adversarial noise design under various constraints.
Abstract
It has been observed that deep learning architectures tend to make erroneous decisions with high reliability for particularly designed adversarial instances. In this work, we show that the perturbation analysis of these architectures provides a framework for generating adversarial instances by convex programming which, for classification tasks, is able to recover variants of existing non-adaptive adversarial methods. The proposed framework can be used for the design of adversarial noise under various desirable constraints and different types of networks. Moreover, this framework is capable of explaining various existing adversarial methods and can be used to derive new algorithms as well. We make use of these results to obtain novel algorithms. The experiments show the competitive performance of the obtained solutions, in terms of fooling ratio, when benchmarked with well-known…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
