Towards Stealthy Backdoor Attacks against Speech Recognition via Elements of Sound
Hanbo Cai, Pengcheng Zhang, Hai Dong, Yan Xiao, Stefanos Koffas,, Yiming Li

TL;DR
This paper introduces stealthy backdoor attack methods against speech recognition models by manipulating sound elements like pitch and timbre, making the attacks more natural and harder to detect.
Contribution
It proposes novel backdoor attack techniques that utilize sound elements such as pitch and timbre to improve stealthiness and effectiveness in speech recognition systems.
Findings
Attacks are effective across various settings including all-to-one and multi-backdoor.
Proposed methods produce more natural and less detectable poisoned samples.
Experiments confirm the attacks' stealthiness and robustness.
Abstract
Deep neural networks (DNNs) have been widely and successfully adopted and deployed in various applications of speech recognition. Recently, a few works revealed that these models are vulnerable to backdoor attacks, where the adversaries can implant malicious prediction behaviors into victim models by poisoning their training process. In this paper, we revisit poison-only backdoor attacks against speech recognition. We reveal that existing methods are not stealthy since their trigger patterns are perceptible to humans or machine detection. This limitation is mostly because their trigger patterns are simple noises or separable and distinctive clips. Motivated by these findings, we propose to exploit elements of sound (, pitch and timbre) to design more stealthy yet effective poison-only backdoor attacks. Specifically, we insert a short-duration high-pitched signal as the trigger and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech Recognition and Synthesis · Adversarial Robustness in Machine Learning
