Detecting Adversarial Examples in Batches -- a geometrical approach
Danush Kumar Venkatesh, Peter Steinbach

TL;DR
This paper introduces two geometric metrics, density and coverage, to detect adversarial examples in batches, demonstrating promising results on MNIST and biomedical datasets, and aims to enhance robustness monitoring in deployed machine learning systems.
Contribution
The paper adapts and evaluates geometric metrics for adversarial detection, providing a novel approach to monitor model robustness against unseen adversarial attacks.
Findings
Both metrics show promising detection capabilities.
Effective on MNIST and biomedical datasets.
Potential for deployment in real-world systems.
Abstract
Many deep learning methods have successfully solved complex tasks in computer vision and speech recognition applications. Nonetheless, the robustness of these models has been found to be vulnerable to perturbed inputs or adversarial examples, which are imperceptible to the human eye, but lead the model to erroneous output decisions. In this study, we adapt and introduce two geometric metrics, density and coverage, and evaluate their use in detecting adversarial samples in batches of unseen data. We empirically study these metrics using MNIST and two real-world biomedical datasets from MedMNIST, subjected to two different adversarial attacks. Our experiments show promising results for both metrics to detect adversarial examples. We believe that his work can lay the ground for further study on these metrics' use in deployed machine learning systems to monitor for possible attacks by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · COVID-19 diagnosis using AI
