Adversarial Reprogramming of Neural Networks
Gamaleldin F. Elsayed, Ian Goodfellow, Jascha Sohl-Dickstein

TL;DR
This paper introduces a novel adversarial attack method called adversarial reprogramming, which manipulates neural networks to perform new tasks without retraining, demonstrated on multiple image classification models and tasks.
Contribution
The paper presents a new type of adversarial attack that reprograms neural networks to perform arbitrary tasks without additional training or task-specific outputs.
Findings
Successfully reprogrammed six ImageNet models for new tasks
Demonstrated reprogramming for counting, MNIST, and CIFAR-10 classification
Reprogramming uses a single universal perturbation for all inputs
Abstract
Deep neural networks are susceptible to \emph{adversarial} attacks. In computer vision, well-crafted perturbations to images can cause neural networks to make mistakes such as confusing a cat with a computer. Previous adversarial attacks have been designed to degrade performance of models or cause machine learning models to produce specific outputs chosen ahead of time by the attacker. We introduce attacks that instead {\em reprogram} the target model to perform a task chosen by the attacker---without the attacker needing to specify or compute the desired output for each test-time input. This attack finds a single adversarial perturbation, that can be added to all test-time inputs to a machine learning model in order to cause the model to perform a task chosen by the adversary---even if the model was not trained to do this task. These perturbations can thus be considered a program for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
This is How You Hack A Neural Network· youtube
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Physical Unclonable Functions (PUFs) and Hardware Security · Bacillus and Francisella bacterial research
