Understanding the One-Pixel Attack: Propagation Maps and Locality   Analysis

Danilo Vasconcellos Vargas; Jiawei Su

arXiv:1902.02947·cs.LG·February 11, 2019·25 cites

Understanding the One-Pixel Attack: Propagation Maps and Locality Analysis

Danilo Vasconcellos Vargas, Jiawei Su

PDF

Open Access

TL;DR

This paper investigates why deep neural networks are vulnerable to single pixel attacks by introducing Propagation Maps, revealing how local perturbations propagate through layers and affect receptive fields, aiding in understanding adversarial vulnerabilities.

Contribution

It introduces Propagation Maps to analyze perturbation propagation and demonstrates that vulnerabilities are rooted in receptive fields rather than individual neurons or pixels.

Findings

01

Single pixel modifications propagate to the last layer even in deep networks.

02

Perturbations spread and reach near-maximum differences in feature maps.

03

Vulnerability is linked to receptive fields, not just neurons or pixels.

Abstract

Deep neural networks were shown to be vulnerable to single pixel modifications. However, the reason behind such phenomena has never been elucidated. Here, we propose Propagation Maps which show the influence of the perturbation in each layer of the network. Propagation Maps reveal that even in extremely deep networks such as Resnet, modification in one pixel easily propagates until the last layer. In fact, this initial local perturbation is also shown to spread becoming a global one and reaching absolute difference values that are close to the maximum value of the original feature maps in a given layer. Moreover, we do a locality analysis in which we demonstrate that nearby pixels of the perturbed one in the one-pixel attack tend to share the same vulnerability, revealing that the main vulnerability lies in neither neurons nor pixels but receptive fields. Hopefully, the analysis…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Physical Unclonable Functions (PUFs) and Hardware Security · Anomaly Detection Techniques and Applications