How Wrong Am I? - Studying Adversarial Examples and their Impact on   Uncertainty in Gaussian Process Machine Learning Models

Kathrin Grosse; David Pfaff; Michael Thomas Smith; Michael Backes

arXiv:1711.06598·cs.CR·January 4, 2019·6 cites

How Wrong Am I? - Studying Adversarial Examples and their Impact on Uncertainty in Gaussian Process Machine Learning Models

Kathrin Grosse, David Pfaff, Michael Thomas Smith, Michael Backes

PDF

Open Access

TL;DR

This paper explores how Gaussian Process models can detect adversarial examples through uncertainty measures, showing promising results in identifying attacks across various datasets and models.

Contribution

It introduces a Bayesian inference framework using Gaussian Processes to analyze adversarial examples and demonstrates their effectiveness in detecting attacks via uncertainty thresholds.

Findings

01

Uncertainty levels in Gaussian Processes reflect adversarial perturbations

02

Uncertainty thresholds can reject many adversarial examples

03

Modified attacks can bypass simple uncertainty-based defenses

Abstract

Machine learning models are vulnerable to Adversarial Examples: minor perturbations to input samples intended to deliberately cause misclassification. Current defenses against adversarial examples, especially for Deep Neural Networks (DNN), are primarily derived from empirical developments, and their security guarantees are often only justified retroactively. Many defenses therefore rely on hidden assumptions that are subsequently subverted by increasingly elaborate attacks. This is not surprising: deep learning notoriously lacks a comprehensive mathematical framework to provide meaningful guarantees. In this paper, we leverage Gaussian Processes to investigate adversarial examples in the framework of Bayesian inference. Across different models and datasets, we find deviating levels of uncertainty reflect the perturbation introduced to benign samples by state-of-the-art attacks,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Software Testing and Debugging Techniques