A New Defense Against Adversarial Images: Turning a Weakness into a Strength
Tao Yu, Shengyuan Hu, Chuan Guo, Wei-Lun Chao, Kilian Q. Weinberger

TL;DR
This paper proposes a novel detection method for adversarial images by exploiting the density of adversarial directions, turning the vulnerability into a detection strength with high accuracy even in white-box scenarios.
Contribution
It introduces a new perspective that uses adversarial perturbation density as a signature for detection, achieving high accuracy in white-box settings.
Findings
High detection accuracy under white-box attack conditions
Adversarial directions are harder to find or denser in tampered images
The method outperforms existing detection techniques
Abstract
Natural images are virtually surrounded by low-density misclassified regions that can be efficiently discovered by gradient-guided search --- enabling the generation of adversarial images. While many techniques for detecting these attacks have been proposed, they are easily bypassed when the adversary has full knowledge of the detection mechanism and adapts the attack strategy accordingly. In this paper, we adopt a novel perspective and regard the omnipresence of adversarial perturbations as a strength rather than a weakness. We postulate that if an image has been tampered with, these adversarial directions either become harder to find with gradient methods or have substantially higher density than for natural images. We develop a practical test for this signature characteristic to successfully detect adversarial attacks, achieving unprecedented accuracy under the white-box setting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Bacillus and Francisella bacterial research
MethodsTest
