Improving the Robustness of Adversarial Attacks Using an Affine-Invariant Gradient Estimator
Wenzhao Xiang, Hang Su, Chang Liu, Yandong Guo, Shibao Zheng

TL;DR
This paper introduces an affine-invariant gradient estimator to enhance the robustness and transferability of adversarial attacks against neural networks, especially under affine transformations, improving attack effectiveness in practical scenarios.
Contribution
The paper proposes a novel affine-invariant gradient estimator that can be integrated with existing attack methods to generate more robust adversarial examples under affine transformations.
Findings
Significantly improves affine invariance of adversarial examples
Enhances transferability of adversarial attacks
Effective under physical conditions
Abstract
As designers of artificial intelligence try to outwit hackers, both sides continue to hone in on AI's inherent vulnerabilities. Designed and trained from certain statistical distributions of data, AI's deep neural networks (DNNs) remain vulnerable to deceptive inputs that violate a DNN's statistical, predictive assumptions. Before being fed into a neural network, however, most existing adversarial examples cannot maintain malicious functionality when applied to an affine transformation. For practical purposes, maintaining that malicious functionality serves as an important measure of the robustness of adversarial attacks. To help DNNs learn to defend themselves more thoroughly against attacks, we propose an affine-invariant adversarial attack, which can consistently produce more robust adversarial examples over affine transformations. For efficiency, we propose to disentangle current…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
