Can Perceptual Guidance Lead to Semantically Explainable Adversarial Perturbations?
P Charantej Reddy, Aditya Siripuram, Sumohana S. Channappayya

TL;DR
This paper introduces a perceptually guided adversarial perturbation framework that uses SSIM-based constraints to generate semantically explainable adversarial examples, revealing that such perturbations target important classification regions.
Contribution
The authors propose a novel SSIM-based convex optimization method for generating perceptually constrained adversarial perturbations that are semantically explainable.
Findings
Perturbations focus on regions important for classification.
Perceptually guided perturbations align with class activation maps.
The method demonstrates improved interpretability of adversarial examples.
Abstract
It is well known that carefully crafted imperceptible perturbations can cause state-of-the-art deep learning classification models to misclassify. Understanding and analyzing these adversarial perturbations play a crucial role in the design of robust convolutional neural networks. However, their mechanics are not well understood. In this work, we attempt to understand the mechanics by systematically answering the following question: do imperceptible adversarial perturbations focus on changing the regions of the image that are important for classification? In other words, are imperceptible adversarial perturbations semantically explainable? Most current methods use distance to generate and characterize the imperceptibility of the adversarial perturbations. However, since distances only measure the pixel to pixel distances and do not consider the structure in the image, these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Explainable Artificial Intelligence (XAI)
MethodsDepthwise Convolution · Pointwise Convolution · Depthwise Separable Convolution · Convolution · Batch Normalization · Average Pooling · Inverted Residual Block · 1x1 Convolution
