A backdoor attack against LSTM-based text classification systems

Jiazhu Dai; Chuanshuai Chen

arXiv:1905.12457·cs.CR·June 5, 2019·19 cites

A backdoor attack against LSTM-based text classification systems

Jiazhu Dai, Chuanshuai Chen

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates a novel backdoor attack on LSTM-based text classification models using data poisoning, achieving high success rates with minimal data alteration and minimal impact on model performance.

Contribution

It introduces a stealthy backdoor attack method for RNN-based text classifiers, expanding the scope of backdoor vulnerabilities beyond CNNs and images.

Findings

01

Achieves around 95% attack success rate with 1% poisoning.

02

Backdoor triggers are stealthy and minimally affect model accuracy.

03

Effective in black-box settings with limited knowledge.

Abstract

With the widespread use of deep learning system in many applications, the adversary has strong incentive to explore vulnerabilities of deep neural networks and manipulate them. Backdoor attacks against deep neural networks have been reported to be a new type of threat. In this attack, the adversary will inject backdoors into the model and then cause the misbehavior of the model through inputs including backdoor triggers. Existed research mainly focuses on backdoor attacks in image classification based on CNN, little attention has been paid to the backdoor attacks in RNN. In this paper, we implement a backdoor attack in text classification based on LSTM by data poisoning. When the backdoor is injected, the model will misclassify any text samples that contains a specific trigger sentence into the target category determined by the adversary. The existence of the backdoor trigger is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

thuyimingli/dvbw
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Anomaly Detection Techniques and Applications