TL;DR
This paper investigates the transferability of adversarial attacks from a data distribution perspective and proposes a novel method that manipulates image distributions to significantly enhance attack transferability across multiple DNNs.
Contribution
It introduces a new approach that improves adversarial transferability by manipulating data distributions, achieving state-of-the-art results in both targeted and untargeted attacks.
Findings
Significantly improves transferability of adversarial examples
Outperforms previous methods by up to 40% in some cases
Effective against multiple DNN architectures
Abstract
Transferable adversarial attacks against Deep neural networks (DNNs) have received broad attention in recent years. An adversarial example can be crafted by a surrogate model and then attack the unknown target model successfully, which brings a severe threat to DNNs. The exact underlying reasons for the transferability are still not completely understood. Previous work mostly explores the causes from the model perspective, e.g., decision boundary, model architecture, and model capacity. adversarial attacks against Deep neural networks (DNNs) have received broad attention in recent years. An adversarial example can be crafted by a surrogate model and then attack the unknown target model successfully, which brings a severe threat to DNNs. The exact underlying reasons for the transferability are still not completely understood. Previous work mostly explores the causes from the model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
