Delving into the pixels of adversarial samples
Blerta Lindqvist

TL;DR
This paper investigates how adversarial attacks alter image pixels across different architectures, revealing architecture-dependent effects and the influence of pre-processing, leading to new detection methods for strong attacks.
Contribution
It provides a detailed pixel-level analysis of adversarial attacks on multiple architectures, highlighting the role of pre-processing and proposing new detection techniques.
Findings
Attack effects vary by classifier architecture
Input pre-processing influences attack impact
New detection methods for strong attacks
Abstract
Despite extensive research into adversarial attacks, we do not know how adversarial attacks affect image pixels. Knowing how image pixels are affected by adversarial attacks has the potential to lead us to better adversarial defenses. Motivated by instances that we find where strong attacks do not transfer, we delve into adversarial examples at pixel level to scrutinize how adversarial attacks affect image pixel values. We consider several ImageNet architectures, InceptionV3, VGG19 and ResNet50, as well as several strong attacks. We find that attacks can have different effects at pixel level depending on classifier architecture. In particular, input pre-processing plays a previously overlooked role in the effect that attacks have on pixels. Based on the insights of pixel-level examination, we find new ways to detect some of the strongest current attacks.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Bacillus and Francisella bacterial research · Anomaly Detection Techniques and Applications
