Using Anomaly Feature Vectors for Detecting, Classifying and Warning of Outlier Adversarial Examples
Nelson Manohar-Alers, Ryan Feng, Sahib Singh, Jiguo Song, Atul Prakash

TL;DR
DeClaW is a system that detects, classifies, and warns about adversarial inputs to neural networks by analyzing anomaly feature vectors, achieving high accuracy in attack type identification on CIFAR-10.
Contribution
The paper introduces anomaly feature vectors for identifying and classifying different adversarial attack types, advancing beyond simple detection methods.
Findings
AFVs distinguish attack types with 93% accuracy on CIFAR-10.
AFV-based analysis can inform attack-specific mitigation strategies.
Preliminary results show promise for broader application in adversarial defense.
Abstract
We present DeClaW, a system for detecting, classifying, and warning of adversarial inputs presented to a classification neural network. In contrast to current state-of-the-art methods that, given an input, detect whether an input is clean or adversarial, we aim to also identify the types of adversarial attack (e.g., PGD, Carlini-Wagner or clean). To achieve this, we extract statistical profiles, which we term as anomaly feature vectors, from a set of latent features. Preliminary findings suggest that AFVs can help distinguish among several types of adversarial attacks (e.g., PGD versus Carlini-Wagner) with close to 93% accuracy on the CIFAR-10 dataset. The results open the door to using AFV-based methods for exploring not only adversarial attack detection but also classification of the attack type and then design of attack-specific mitigation strategies.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Anomaly Detection Techniques and Applications
