Feature Space Targeted Attacks by Statistic Alignment
Lianli Gao, Yaya Cheng, Qilong Zhang, Xing Xu, Jingkuan Song

TL;DR
This paper introduces novel feature space targeted attack methods using statistic alignment, which improve attack transferability and effectiveness by addressing limitations of pixel-wise discrepancy measures.
Contribution
It proposes two new attack approaches based on high-order statistic alignment, enhancing transferability and success rate over existing methods.
Findings
Outperforms state-of-the-art attack algorithms significantly.
Effective across various neural network layers and difficulty levels.
Code is publicly available for reproducibility.
Abstract
By adding human-imperceptible perturbations to images, DNNs can be easily fooled. As one of the mainstream methods, feature space targeted attacks perturb images by modulating their intermediate feature maps, for the discrepancy between the intermediate source and target features is minimized. However, the current choice of pixel-wise Euclidean Distance to measure the discrepancy is questionable because it unreasonably imposes a spatial-consistency constraint on the source and target features. Intuitively, an image can be categorized as "cat" no matter the cat is on the left or right of the image. To address this issue, we propose to measure this discrepancy using statistic alignment. Specifically, we design two novel approaches called Pair-wise Alignment Attack and Global-wise Alignment Attack, which attempt to measure similarities between feature maps by high-order statistics with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning
