Understanding the One-Pixel Attack: Propagation Maps and Locality Analysis
Danilo Vasconcellos Vargas, Jiawei Su

TL;DR
This paper investigates why deep neural networks are vulnerable to single pixel attacks by introducing Propagation Maps, revealing how local perturbations propagate through layers and affect receptive fields, aiding in understanding adversarial vulnerabilities.
Contribution
It introduces Propagation Maps to analyze perturbation propagation and demonstrates that vulnerabilities are rooted in receptive fields rather than individual neurons or pixels.
Findings
Single pixel modifications propagate to the last layer even in deep networks.
Perturbations spread and reach near-maximum differences in feature maps.
Vulnerability is linked to receptive fields, not just neurons or pixels.
Abstract
Deep neural networks were shown to be vulnerable to single pixel modifications. However, the reason behind such phenomena has never been elucidated. Here, we propose Propagation Maps which show the influence of the perturbation in each layer of the network. Propagation Maps reveal that even in extremely deep networks such as Resnet, modification in one pixel easily propagates until the last layer. In fact, this initial local perturbation is also shown to spread becoming a global one and reaching absolute difference values that are close to the maximum value of the original feature maps in a given layer. Moreover, we do a locality analysis in which we demonstrate that nearby pixels of the perturbed one in the one-pixel attack tend to share the same vulnerability, revealing that the main vulnerability lies in neither neurons nor pixels but receptive fields. Hopefully, the analysis…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Physical Unclonable Functions (PUFs) and Hardware Security · Anomaly Detection Techniques and Applications
