GNP Attack: Transferable Adversarial Examples via Gradient Norm Penalty

Tao Wu; Tie Luo; Donald C. Wunsch

arXiv:2307.04099·cs.LG·July 11, 2023

GNP Attack: Transferable Adversarial Examples via Gradient Norm Penalty

Tao Wu, Tie Luo, Donald C. Wunsch

PDF

Open Access

TL;DR

This paper introduces a Gradient Norm Penalty method to improve the transferability of adversarial examples, making black-box attacks more effective across diverse models and defenses.

Contribution

It proposes a novel Gradient Norm Penalty approach that enhances adversarial example transferability by encouraging optimization to find flatter local optima.

Findings

01

GNP significantly improves transferability across multiple models.

02

GNP can be integrated with other gradient-based attack methods.

03

The method is effective against advanced defense mechanisms.

Abstract

Adversarial examples (AE) with good transferability enable practical black-box attacks on diverse target models, where insider knowledge about the target models is not required. Previous methods often generate AE with no or very limited transferability; that is, they easily overfit to the particular architecture and feature representation of the source, white-box model and the generated AE barely work for target, black-box models. In this paper, we propose a novel approach to enhance AE transferability using Gradient Norm Penalty (GNP). It drives the loss function optimization procedure to converge to a flat region of local optima in the loss landscape. By attacking 11 state-of-the-art (SOTA) deep learning models and 6 advanced defense methods, we empirically show that GNP is very effective in generating AE with high transferability. We also demonstrate that it is very flexible in that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications

MethodsAutoencoders