GradID: Adversarial Detection via Intrinsic Dimensionality of Gradients
Mohammad Mahdi Razmjoo, Mohammad Mahdi Sharifian, Saeed Bagheri Shouraki

TL;DR
This paper introduces a novel adversarial detection method based on analyzing the intrinsic dimensionality of model gradients, effectively distinguishing natural from adversarial data across various datasets and attack types.
Contribution
The paper proposes a new geometric approach using intrinsic dimensionality of gradients for robust adversarial detection, outperforming existing methods on multiple benchmarks.
Findings
High detection accuracy (>92%) on CIFAR-10 against various attacks.
Effective in both batch-wise and individual-sample detection scenarios.
Robust across datasets like MNIST, SVHN, CIFAR-10, and MS COCO.
Abstract
Despite their remarkable performance, deep neural networks exhibit a critical vulnerability: small, often imperceptible, adversarial perturbations can lead to drastically altered model predictions. Given the stringent reliability demands of applications such as medical diagnosis and autonomous driving, robust detection of such adversarial attacks is paramount. In this paper, we investigate the geometric properties of a model's input loss landscape. We analyze the Intrinsic Dimensionality (ID) of the model's gradient parameters, which quantifies the minimal number of coordinates required to describe the data points on their underlying manifold. We reveal a distinct and consistent difference in the ID for natural and adversarial data, which forms the basis of our proposed detection method. We validate our approach across two distinct operational scenarios. First, in a batch-wise context…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Ethics and Social Impacts of AI
