BadNets: Identifying Vulnerabilities in the Machine Learning Model   Supply Chain

Tianyu Gu; Brendan Dolan-Gavitt; Siddharth Garg

arXiv:1708.06733·cs.CR·March 13, 2019·1.0k cites

BadNets: Identifying Vulnerabilities in the Machine Learning Model Supply Chain

Tianyu Gu, Brendan Dolan-Gavitt, Siddharth Garg

PDF

Open Access 5 Repos

TL;DR

This paper reveals security vulnerabilities in outsourced training of neural networks, demonstrating how maliciously trained backdoored models can perform normally on standard data but behave maliciously on specific inputs, posing significant security risks.

Contribution

It introduces the concept of BadNets, showing how backdoors can be embedded in neural networks during training, and provides experimental evidence of their effectiveness and stealthiness.

Findings

01

Backdoors can be embedded in neural networks with minimal impact on normal performance.

02

Backdoored models can be stealthy and difficult to detect.

03

Backdoors persist even after retraining for different tasks.

Abstract

Deep learning-based techniques have achieved state-of-the-art performance on a wide variety of recognition and classification tasks. However, these networks are typically computationally expensive to train, requiring weeks of computation on many GPUs; as a result, many users outsource the training procedure to the cloud or rely on pre-trained models that are then fine-tuned for a specific task. In this paper we show that outsourced training introduces new security risks: an adversary can create a maliciously trained network (a backdoored neural network, or a \emph{BadNet}) that has state-of-the-art performance on the user's training and validation samples, but behaves badly on specific attacker-chosen inputs. We first explore the properties of BadNets in a toy example, by creating a backdoored handwritten digit classifier. Next, we demonstrate backdoors in a more realistic scenario by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Anomaly Detection Techniques and Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings