Adversarial Machine Learning at Scale

Alexey Kurakin; Ian Goodfellow; Samy Bengio

arXiv:1611.01236·cs.CV·February 14, 2017·377 cites

Adversarial Machine Learning at Scale

Alexey Kurakin, Ian Goodfellow, Samy Bengio

PDF

Open Access 5 Repos

TL;DR

This paper demonstrates how to scale adversarial training to large datasets like ImageNet, revealing insights into model robustness, transferability of attack methods, and addressing label leaking effects in adversarial machine learning.

Contribution

It provides practical recommendations for scaling adversarial training to large models and datasets, and offers new insights into attack transferability and label leaking phenomena.

Findings

01

Adversarial training can be successfully scaled to ImageNet.

02

Single-step attacks are more transferable for black-box attacks.

03

Multi-step attacks are less transferable than single-step attacks.

Abstract

Adversarial examples are malicious inputs designed to fool machine learning models. They often transfer from one model to another, allowing attackers to mount black box attacks without knowledge of the target model's parameters. Adversarial training is the process of explicitly training a model on adversarial examples, in order to make it more robust to attack or to reduce its test error on clean inputs. So far, adversarial training has primarily been applied to small problems. In this research, we apply adversarial training to ImageNet. Our contributions include: (1) recommendations for how to succesfully scale adversarial training to large models and datasets, (2) the observation that adversarial training confers robustness to single-step attack methods, (3) the finding that multi-step attack methods are somewhat less transferable than single-step attack methods, so single-step…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications