The Limitations of Deep Learning in Adversarial Settings

Nicolas Papernot; Patrick McDaniel; Somesh Jha; Matt; Fredrikson; Z. Berkay Celik; Ananthram Swami

arXiv:1511.07528·cs.CR·November 25, 2015·72 cites

The Limitations of Deep Learning in Adversarial Settings

Nicolas Papernot, Patrick McDaniel, Somesh Jha, Matt, Fredrikson, Z. Berkay Celik, Ananthram Swami

PDF

Open Access 5 Repos

TL;DR

This paper formalizes the vulnerability of deep neural networks to adversarial samples, introduces algorithms to craft such samples with high success rates, and explores potential defenses, highlighting significant security concerns in deep learning applications.

Contribution

The paper introduces a novel formal framework for understanding adversaries against DNNs and develops algorithms to generate highly effective adversarial samples.

Findings

01

97% success rate in fooling DNNs with minimal input modifications

02

Average of 4.02% feature modification per adversarial sample

03

Vulnerability varies across different sample classes

Abstract

Deep learning takes advantage of large datasets and computationally efficient training algorithms to outperform other approaches at various machine learning tasks. However, imperfections in the training phase of deep neural networks make them vulnerable to adversarial samples: inputs crafted by adversaries with the intent of causing deep neural networks to misclassify. In this work, we formalize the space of adversaries against deep neural networks (DNNs) and introduce a novel class of algorithms to craft adversarial samples based on a precise understanding of the mapping between inputs and outputs of DNNs. In an application to computer vision, we show that our algorithms can reliably produce samples correctly classified by human subjects but misclassified in specific targets by a DNN with a 97% adversarial success rate while only modifying on average 4.02% of the input features per…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Neural Network Applications