Attack Agnostic Detection of Adversarial Examples via Random Subspace Analysis
Nathan Drenkow, Neil Fendley, Philippe Burlina

TL;DR
This paper introduces a novel attack-agnostic method for detecting adversarial examples using random subspace analysis, which outperforms existing strategies and requires less calibration data.
Contribution
The paper proposes a new detection technique based on random projections that is attack-agnostic and effective across diverse adversarial strategies.
Findings
Achieves high AUC scores between 0.92 and 0.98
Outperforms competing detection methods significantly
Requires less calibration data than existing approaches
Abstract
Whilst adversarial attack detection has received considerable attention, it remains a fundamentally challenging problem from two perspectives. First, while threat models can be well-defined, attacker strategies may still vary widely within those constraints. Therefore, detection should be considered as an open-set problem, standing in contrast to most current detection approaches. These methods take a closed-set view and train binary detectors, thus biasing detection toward attacks seen during detector training. Second, limited information is available at test time and typically confounded by nuisance factors including the label and underlying content of the image. We address these challenges via a novel strategy based on random subspace analysis. We present a technique that utilizes properties of random projections to characterize the behavior of clean and adversarial examples across a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Attack Agnostic Detection of Adversarial Examples via Random Subspace Analysis· youtube
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Forensic and Genetic Research
