Targeted Backdoor Attacks on Deep Learning Systems Using Data Poisoning

Xinyun Chen; Chang Liu; Bo Li; Kimberly Lu; Dawn Song

arXiv:1712.05526·cs.CR·December 18, 2017·1.0k cites

Targeted Backdoor Attacks on Deep Learning Systems Using Data Poisoning

Xinyun Chen, Chang Liu, Bo Li, Kimberly Lu, Dawn Song

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new type of backdoor poisoning attack on deep learning systems that requires minimal knowledge and data, demonstrating high success rates and physical implementability, raising security concerns.

Contribution

It presents the first data poisoning attack capable of creating stealthy backdoors without access to training data or models, under a weak threat model.

Findings

01

Achieves over 90% attack success rate with only 50 poisoning samples.

02

Can create physically implementable backdoors without retraining.

03

Effective under a very weak threat model.

Abstract

Deep learning models have achieved high performance on many tasks, and thus have been applied to many security-critical scenarios. For example, deep learning-based face recognition systems have been used to authenticate users to access many security-sensitive applications like payment apps. Such usages of deep learning systems provide the adversaries with sufficient incentives to perform attacks against these systems for their adversarial purposes. In this work, we consider a new type of attacks, called backdoor attacks, where the attacker's goal is to create a backdoor into a learning-based authentication system, so that he can easily circumvent the system by leveraging the backdoor. Specifically, the adversary aims at creating backdoor instances, so that the victim learning system will be misled to classify the backdoor instances as a target label specified by the adversary. In…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bxz9200/ultraclean
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Deception detection and forensic psychology