Adversarial Reprogramming of Neural Networks

Gamaleldin F. Elsayed; Ian Goodfellow; Jascha Sohl-Dickstein

arXiv:1806.11146·cs.LG·September 12, 2019·21 cites

Adversarial Reprogramming of Neural Networks

Gamaleldin F. Elsayed, Ian Goodfellow, Jascha Sohl-Dickstein

PDF

Open Access 5 Repos 1 Video

TL;DR

This paper introduces a novel adversarial attack method called adversarial reprogramming, which manipulates neural networks to perform new tasks without retraining, demonstrated on multiple image classification models and tasks.

Contribution

The paper presents a new type of adversarial attack that reprograms neural networks to perform arbitrary tasks without additional training or task-specific outputs.

Findings

01

Successfully reprogrammed six ImageNet models for new tasks

02

Demonstrated reprogramming for counting, MNIST, and CIFAR-10 classification

03

Reprogramming uses a single universal perturbation for all inputs

Abstract

Deep neural networks are susceptible to \emph{adversarial} attacks. In computer vision, well-crafted perturbations to images can cause neural networks to make mistakes such as confusing a cat with a computer. Previous adversarial attacks have been designed to degrade performance of models or cause machine learning models to produce specific outputs chosen ahead of time by the attacker. We introduce attacks that instead {\em reprogram} the target model to perform a task chosen by the attacker---without the attacker needing to specify or compute the desired output for each test-time input. This attack finds a single adversarial perturbation, that can be added to all test-time inputs to a machine learning model in order to cause the model to perform a task chosen by the adversary---even if the model was not trained to do this task. These perturbations can thus be considered a program for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

This is How You Hack A Neural Network· youtube

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Physical Unclonable Functions (PUFs) and Hardware Security · Bacillus and Francisella bacterial research