On the Limitation of Local Intrinsic Dimensionality for Characterizing the Subspaces of Adversarial Examples
Pei-Hsuan Lu, Pin-Yu Chen, Chia-Mu Yu

TL;DR
This paper critically examines the effectiveness of local intrinsic dimensionality (LID) in characterizing adversarial subspaces of deep neural networks, revealing significant limitations in various attack scenarios and confidence levels.
Contribution
The study identifies key limitations of LID in analyzing adversarial subspaces, especially for oblivious, confidence-varying, and transfer attack scenarios, extending previous analyses.
Findings
LID performance is highly sensitive to attack confidence parameters.
LID performs poorly with adversarial examples of varying confidence levels.
LID is ineffective in characterizing transfer-based adversarial examples.
Abstract
Understanding and characterizing the subspaces of adversarial examples aid in studying the robustness of deep neural networks (DNNs) to adversarial perturbations. Very recently, Ma et al. (ICLR 2018) proposed to use local intrinsic dimensionality (LID) in layer-wise hidden representations of DNNs to study adversarial subspaces. It was demonstrated that LID can be used to characterize the adversarial subspaces associated with different attack methods, e.g., the Carlini and Wagner's (C&W) attack and the fast gradient sign attack. In this paper, we use MNIST and CIFAR-10 to conduct two new sets of experiments that are absent in existing LID analysis and report the limitation of LID in characterizing the corresponding adversarial subspaces, which are (i) oblivious attacks and LID analysis using adversarial examples with different confidence levels; and (ii) black-box transfer attacks. For…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Physical Unclonable Functions (PUFs) and Hardware Security · Advanced Malware Detection Techniques
