Towards Stealthy Backdoor Attacks against Speech Recognition via   Elements of Sound

Hanbo Cai; Pengcheng Zhang; Hai Dong; Yan Xiao; Stefanos Koffas,; Yiming Li

arXiv:2307.08208·cs.SD·July 18, 2023·2 cites

Towards Stealthy Backdoor Attacks against Speech Recognition via Elements of Sound

Hanbo Cai, Pengcheng Zhang, Hai Dong, Yan Xiao, Stefanos Koffas,, Yiming Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces stealthy backdoor attack methods against speech recognition models by manipulating sound elements like pitch and timbre, making the attacks more natural and harder to detect.

Contribution

It proposes novel backdoor attack techniques that utilize sound elements such as pitch and timbre to improve stealthiness and effectiveness in speech recognition systems.

Findings

01

Attacks are effective across various settings including all-to-one and multi-backdoor.

02

Proposed methods produce more natural and less detectable poisoned samples.

03

Experiments confirm the attacks' stealthiness and robustness.

Abstract

Deep neural networks (DNNs) have been widely and successfully adopted and deployed in various applications of speech recognition. Recently, a few works revealed that these models are vulnerable to backdoor attacks, where the adversaries can implant malicious prediction behaviors into victim models by poisoning their training process. In this paper, we revisit poison-only backdoor attacks against speech recognition. We reveal that existing methods are not stealthy since their trigger patterns are perceptible to humans or machine detection. This limitation is mostly because their trigger patterns are simple noises or separable and distinctive clips. Motivated by these findings, we propose to exploit elements of sound ( $e . g .$ , pitch and timbre) to design more stealthy yet effective poison-only backdoor attacks. Specifically, we insert a short-duration high-pitched signal as the trigger and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hanbocai/badspeech_soe
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech Recognition and Synthesis · Adversarial Robustness in Machine Learning