Understanding Measures of Uncertainty for Adversarial Example Detection

Lewis Smith; Yarin Gal

arXiv:1803.08533·stat.ML·March 26, 2018·184 cites

Understanding Measures of Uncertainty for Adversarial Example Detection

Lewis Smith, Yarin Gal

PDF

Open Access 1 Repo

TL;DR

This paper analyzes various uncertainty measures for detecting adversarial examples, highlighting mutual information's effectiveness and proposing probabilistic ensembles to improve uncertainty estimation.

Contribution

It provides a comparative study of uncertainty measures, explains why mutual information works well, and introduces probabilistic ensembles to enhance adversarial detection.

Findings

01

Mutual information effectively detects adversarial examples.

02

MC dropout has notable failure modes.

03

Probabilistic ensembles improve uncertainty estimates.

Abstract

Measuring uncertainty is a promising technique for detecting adversarial examples, crafted inputs on which the model predicts an incorrect class with high confidence. But many measures of uncertainty exist, including predictive en- tropy and mutual information, each capturing different types of uncertainty. We study these measures, and shed light on why mutual information seems to be effective at the task of adversarial example detection. We highlight failure modes for MC dropout, a widely used approach for estimating uncertainty in deep models. This leads to an improved understanding of the drawbacks of current methods, and a proposal to improve the quality of uncertainty estimates using probabilistic model ensembles. We give illustrative experiments using MNIST to demonstrate the intuition underlying the different measures of uncertainty, as well as experiments on a real world Kaggle…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lsgos/uncertainty-adversarial-paper
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Malware Detection Techniques