Stealthy Backdoor Attack via Confidence-driven Sampling

Pengfei He; Yue Xing; Han Xu; Jie Ren; Yingqian Cui; Shenglai Zeng,; Jiliang Tang; Makoto Yamada; Mohammad Sabokrou

arXiv:2310.05263·cs.CR·December 3, 2024·1 cites

Stealthy Backdoor Attack via Confidence-driven Sampling

Pengfei He, Yue Xing, Han Xu, Jie Ren, Yingqian Cui, Shenglai Zeng,, Jiliang Tang, Makoto Yamada, Mohammad Sabokrou

PDF

Open Access

TL;DR

This paper presents a novel backdoor attack method that strategically poisons samples near the decision boundary using confidence scores, making detection and defense more difficult while maintaining compatibility with various trigger designs.

Contribution

It introduces a confidence-driven sampling technique for backdoor attacks that improves stealthiness and robustness against defenses, addressing limitations of previous random sampling methods.

Findings

01

Significantly increases attack stealthiness and effectiveness.

02

Reduces detectability by defense mechanisms.

03

Operates independently of trigger design variations.

Abstract

Backdoor attacks aim to surreptitiously insert malicious triggers into DNN models, granting unauthorized control during testing scenarios. Existing methods lack robustness against defense strategies and predominantly focus on enhancing trigger stealthiness while randomly selecting poisoned samples. Our research highlights the overlooked drawbacks of random sampling, which make that attack detectable and defensible. The core idea of this paper is to strategically poison samples near the model's decision boundary and increase defense difficulty. We introduce a straightforward yet highly effective sampling methodology that leverages confidence scores. Specifically, it selects samples with lower confidence scores, significantly increasing the challenge for defenders in identifying and countering these attacks. Importantly, our method operates independently of existing trigger designs,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Network Security and Intrusion Detection · Advanced Malware Detection Techniques