Benford's law: what does it say on adversarial images?
Jo\~ao G. Zago, Fabio L. Baldissera, Eric A. Antonelo, Rodrigo T., Saad

TL;DR
This paper explores how Benford's law can be used to detect adversarial images by analyzing the distribution of leading digits in pixel values, revealing deviations caused by attacks.
Contribution
It introduces a novel statistical approach using Benford's law to identify adversarial images without altering the CNN or relying on raw pixel features.
Findings
Adversarial images deviate from Benford's law in their pixel leading digit distribution.
Stronger attacks cause greater deviation from Benford's law.
The method offers a potential new defense mechanism against adversarial attacks.
Abstract
Convolutional neural networks (CNNs) are fragile to small perturbations in the input images. These networks are thus prone to malicious attacks that perturb the inputs to force a misclassification. Such slightly manipulated images aimed at deceiving the classifier are known as adversarial images. In this work, we investigate statistical differences between natural images and adversarial ones. More precisely, we show that employing a proper image transformation and for a class of adversarial attacks, the distribution of the leading digit of the pixels in adversarial images deviates from Benford's law. The stronger the attack, the more distant the resulting distribution is from Benford's law. Our analysis provides a detailed investigation of this new approach that can serve as a basis for alternative adversarial example detection methods that do not need to modify the original CNN…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Benford’s Law and Fraud Detection · Digital Media Forensic Detection
