Spatially Correlated Patterns in Adversarial Images
Nandish Chattopadhyay, Lionell Yip En Zhi, Bryan Tan Bing Xing and, Anupam Chattopadhyay

TL;DR
This paper investigates spatial patterns in adversarial images, proposing a framework to identify critical regions for classification and attack, which can inform better defense strategies against adversarial perturbations.
Contribution
It introduces a novel framework for segregating and neutralizing vulnerable image regions, enhancing interpretability and defense against adversarial attacks.
Findings
Mapping features into input space preserves significant patterns.
Segregation of regions improves understanding of attack and classification.
Neutralizing vulnerable regions can enhance adversarial defenses.
Abstract
Adversarial attacks have proved to be the major impediment in the progress on research towards reliable machine learning solutions. Carefully crafted perturbations, imperceptible to human vision, can be added to images to force misclassification by an otherwise high performing neural network. To have a better understanding of the key contributors of such structured attacks, we searched for and studied spatially co-located patterns in the distribution of pixels in the input space. In this paper, we propose a framework for segregating and isolating regions within an input image which are particularly critical towards either classification (during inference), or adversarial vulnerability or both. We assert that during inference, the trained model looks at a specific region in the image, which we call Region of Importance (RoI); and the attacker looks at a region to alter/modify, which we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Image Processing Techniques and Applications · Adversarial Robustness in Machine Learning
MethodsInterpretability
