Transferability in Machine Learning: from Phenomena to Black-Box Attacks   using Adversarial Samples

Nicolas Papernot; Patrick McDaniel; Ian Goodfellow

arXiv:1605.07277·cs.CR·May 25, 2016·1.4k cites

Transferability in Machine Learning: from Phenomena to Black-Box Attacks using Adversarial Samples

Nicolas Papernot, Patrick McDaniel, Ian Goodfellow

PDF

Open Access

TL;DR

This paper explores the transferability of adversarial examples across different machine learning models, demonstrating effective black-box attacks on commercial systems with minimal queries, and introduces techniques to improve attack efficiency.

Contribution

The authors extend transferability techniques using reservoir sampling, enabling efficient black-box attacks on diverse models like SVMs and decision trees, including commercial systems.

Findings

01

Achieved over 96% misclassification on Amazon system

02

Attacked Google system with nearly 89% success rate

03

Only 800 queries needed for effective transfer attacks

Abstract

Many machine learning models are vulnerable to adversarial examples: inputs that are specially crafted to cause a machine learning model to produce an incorrect output. Adversarial examples that affect one model often affect another model, even if the two models have different architectures or were trained on different training sets, so long as both models were trained to perform the same task. An attacker may therefore train their own substitute model, craft adversarial examples against the substitute, and transfer them to a victim model, with very little information about the victim. Recent work has further developed a technique that uses the victim model as an oracle to label a synthetic training set for the substitute, so the attacker need not even collect a training set to mount the attack. We extend these recent techniques using reservoir sampling to greatly enhance the efficiency…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Malware Detection Techniques