Disrupting Semantic and Abstract Features for Better Adversarial Transferability
Yuyang Luo, Xiaosen Wang, Zhijin Ge, Yingzhe He

TL;DR
This paper introduces SAFER, a novel method that disrupts both semantic and abstract features in images by manipulating high-frequency components, significantly enhancing the transferability of adversarial examples across neural networks.
Contribution
SAFER is the first approach to simultaneously target semantic and high-frequency features for improved adversarial transferability.
Findings
SAFER outperforms existing methods in transferability on ImageNet.
Disrupting high-frequency features enhances attack success in black-box settings.
The method effectively combines semantic and abstract feature disruption for stronger adversarial attacks.
Abstract
Adversarial examples pose significant threats to deep neural networks (DNNs), and their property of transferability in the black-box setting has led to the emergence of transfer-based attacks, making it feasible to target real-world applications employing DNNs. Among them, feature-level attacks, where intermediate features are perturbed based on feature importance weight matrix computed from transformed images, have gained popularity. In this work, we find that existing feature-level attacks primarily manipulate the semantic information to derive the weight matrix. Inspired by several works that find CNNs tend to focus more on high-frequency components (a.k.a. abstract features, e.g., texture, edge, etc.), we validate that transforming images in the high-frequency space also improves transferability. Based on this finding, we propose a balanced approach called Semantic and Abstract…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Digital Media Forensic Detection · Digital and Cyber Forensics
