Characterizing Adversarial Subspaces Using Local Intrinsic   Dimensionality

Xingjun Ma; Bo Li; Yisen Wang; Sarah M. Erfani; Sudanthi Wijewickrema,; Grant Schoenebeck; Dawn Song; Michael E. Houle; James Bailey

arXiv:1801.02613·cs.LG·March 15, 2018·154 cites

Characterizing Adversarial Subspaces Using Local Intrinsic Dimensionality

Xingjun Ma, Bo Li, Yisen Wang, Sarah M. Erfani, Sudanthi Wijewickrema,, Grant Schoenebeck, Dawn Song, Michael E. Houle, James Bailey

PDF

Open Access 1 Repo

TL;DR

This paper uses Local Intrinsic Dimensionality to characterize and distinguish adversarial regions in deep neural networks, providing insights that could improve detection and understanding of adversarial attacks.

Contribution

It introduces LID as a novel metric to analyze adversarial subspaces and demonstrates its effectiveness in detecting adversarial examples across multiple attack strategies.

Findings

01

LID can effectively distinguish adversarial examples from normal data.

02

LID outperforms several state-of-the-art detection methods.

03

Analysis suggests new directions for adversarial defense and attack development.

Abstract

Deep Neural Networks (DNNs) have recently been shown to be vulnerable against adversarial examples, which are carefully crafted instances that can mislead DNNs to make errors during prediction. To better understand such attacks, a characterization is needed of the properties of regions (the so-called 'adversarial subspaces') in which adversarial examples lie. We tackle this challenge by characterizing the dimensional properties of adversarial regions, via the use of Local Intrinsic Dimensionality (LID). LID assesses the space-filling capability of the region surrounding a reference example, based on the distance distribution of the example to its neighbors. We first provide explanations about how adversarial perturbation can affect the LID characteristic of adversarial regions, and then show empirically that LID characteristics can facilitate the distinction of adversarial examples…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xingjunm/lid_adversarial_subspace_detection
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Physical Unclonable Functions (PUFs) and Hardware Security · Anomaly Detection Techniques and Applications