On the Limitation of Local Intrinsic Dimensionality for Characterizing   the Subspaces of Adversarial Examples

Pei-Hsuan Lu; Pin-Yu Chen; Chia-Mu Yu

arXiv:1803.09638·cs.LG·April 11, 2018·26 cites

On the Limitation of Local Intrinsic Dimensionality for Characterizing the Subspaces of Adversarial Examples

Pei-Hsuan Lu, Pin-Yu Chen, Chia-Mu Yu

PDF

Open Access 1 Repo

TL;DR

This paper critically examines the effectiveness of local intrinsic dimensionality (LID) in characterizing adversarial subspaces of deep neural networks, revealing significant limitations in various attack scenarios and confidence levels.

Contribution

The study identifies key limitations of LID in analyzing adversarial subspaces, especially for oblivious, confidence-varying, and transfer attack scenarios, extending previous analyses.

Findings

01

LID performance is highly sensitive to attack confidence parameters.

02

LID performs poorly with adversarial examples of varying confidence levels.

03

LID is ineffective in characterizing transfer-based adversarial examples.

Abstract

Understanding and characterizing the subspaces of adversarial examples aid in studying the robustness of deep neural networks (DNNs) to adversarial perturbations. Very recently, Ma et al. (ICLR 2018) proposed to use local intrinsic dimensionality (LID) in layer-wise hidden representations of DNNs to study adversarial subspaces. It was demonstrated that LID can be used to characterize the adversarial subspaces associated with different attack methods, e.g., the Carlini and Wagner's (C&W) attack and the fast gradient sign attack. In this paper, we use MNIST and CIFAR-10 to conduct two new sets of experiments that are absent in existing LID analysis and report the limitation of LID in characterizing the corresponding adversarial subspaces, which are (i) oblivious attacks and LID analysis using adversarial examples with different confidence levels; and (ii) black-box transfer attacks. For…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ysharma1126/EAD_Attack
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Physical Unclonable Functions (PUFs) and Hardware Security · Advanced Malware Detection Techniques