Adversarial Music: Real World Audio Adversary Against Wake-word Detection System
Juncheng B. Li, Shuhui Qu, Xinjian Li, Joseph Szurley, J. Zico Kolter,, Florian Metze

TL;DR
This paper demonstrates a real-world audio adversarial attack using inconspicuous background music to deactivate commercial voice assistants like Alexa by significantly reducing wake-word detection accuracy.
Contribution
It introduces the first real-world adversarial attack on a commercial-grade wake-word detection system using an emulated model and over-the-air testing with music as an adversarial signal.
Findings
Effective reduction of wake-word detection F1 score from over 93% to 11%
Successful over-the-air attack against Alexa, reducing F1 score from 92.5% to 11%
Non-adversarial music does not disable Alexa as effectively
Abstract
Voice Assistants (VAs) such as Amazon Alexa or Google Assistant rely on wake-word detection to respond to people's commands, which could potentially be vulnerable to audio adversarial examples. In this work, we target our attack on the wake-word detection system, jamming the model with some inconspicuous background music to deactivate the VAs while our audio adversary is present. We implemented an emulated wake-word detection system of Amazon Alexa based on recent publications. We validated our models against the real Alexa in terms of wake-word detection accuracy. Then we computed our audio adversaries with consideration of expectation over transform and we implemented our audio adversary with a differentiable synthesizer. Next, we verified our audio adversaries digitally on hundreds of samples of utterances collected from the real world. Our experiments show that we can effectively…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Hate Speech and Cyberbullying Detection · Anomaly Detection Techniques and Applications
