Defensive Distillation is Not Robust to Adversarial Examples

Nicholas Carlini; David Wagner

arXiv:1607.04311·cs.CR·July 18, 2016·238 cites

Defensive Distillation is Not Robust to Adversarial Examples

Nicholas Carlini, David Wagner

PDF

Open Access

TL;DR

This paper demonstrates that defensive distillation, previously thought to improve neural network robustness, does not actually provide resistance against targeted adversarial attacks.

Contribution

The study reveals that defensive distillation fails to enhance neural network security against adversarial examples, challenging prior assumptions.

Findings

01

Defensive distillation does not improve robustness against targeted attacks.

02

Neural networks with defensive distillation are as vulnerable as unprotected models.

03

The paper provides evidence that defensive distillation is ineffective for security.

Abstract

We show that defensive distillation is not secure: it is no more resistant to targeted misclassification attacks than unprotected neural networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning