Deep neural network loses attention to adversarial images
Shashank Kotyan, Danilo Vasconcellos Vargas

TL;DR
This paper investigates how different adversarial attacks affect neural network attention by analyzing saliency and activation maps, revealing diverse mechanisms of attention disruption and implications for robustness.
Contribution
It provides a detailed analysis of how various adversarial attacks distort neural network attention, enhancing understanding of attack mechanisms and defense vulnerabilities.
Findings
Pixel Attack redirects attention to perturbed pixels or away from them
Projected Gradient Descent attack reduces attention in intermediate layers
Different attacks impact saliency and activation maps in distinct ways
Abstract
Adversarial algorithms have shown to be effective against neural networks for a variety of tasks. Some adversarial algorithms perturb all the pixels in the image minimally for the image classification task in image classification. In contrast, some algorithms perturb few pixels strongly. However, very little information is available regarding why these adversarial samples so diverse from each other exist. Recently, Vargas et al. showed that the existence of these adversarial samples might be due to conflicting saliency within the neural network. We test this hypothesis of conflicting saliency by analysing the Saliency Maps (SM) and Gradient-weighted Class Activation Maps (Grad-CAM) of original and few different types of adversarial samples. We also analyse how different adversarial samples distort the attention of the neural network compared to original samples. We show that in the case…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Cell Image Analysis Techniques
