CommanderSong: A Systematic Approach for Practical Adversarial Voice   Recognition

Xuejing Yuan; Yuxuan Chen; Yue Zhao; Yunhui Long; Xiaokang Liu; Kai; Chen; Shengzhi Zhang; Heqing Huang; Xiaofeng Wang; and Carl A. Gunter

arXiv:1801.08535·cs.CR·July 3, 2018·77 cites

CommanderSong: A Systematic Approach for Practical Adversarial Voice Recognition

Xuejing Yuan, Yuxuan Chen, Yue Zhao, Yunhui Long, Xiaokang Liu, Kai, Chen, Shengzhi Zhang, Heqing Huang, Xiaofeng Wang, and Carl A. Gunter

PDF

Open Access

TL;DR

This paper introduces CommanderSong, a novel method for embedding voice commands into songs that can covertly control speech recognition systems, highlighting a new security threat and proposing a mitigation approach.

Contribution

It presents a systematic approach to create practical, stealthy adversarial voice commands embedded in songs, demonstrating real-world feasibility and threat mitigation.

Findings

01

Commands embedded in songs can control ASR systems effectively.

02

Such attacks can be distributed via Internet platforms like YouTube.

03

A mitigation technique can reduce the threat of CommanderSongs.

Abstract

The popularity of ASR (automatic speech recognition) systems, like Google Voice, Cortana, brings in security concerns, as demonstrated by recent attacks. The impacts of such threats, however, are less clear, since they are either less stealthy (producing noise-like voice commands) or requiring the physical presence of an attack device (using ultrasound). In this paper, we demonstrate that not only are more practical and surreptitious attacks feasible but they can even be automatically constructed. Specifically, we find that the voice commands can be stealthily embedded into songs, which, when played, can effectively control the target system through ASR without being noticed. For this purpose, we developed novel techniques that address a key technical challenge: integrating the commands into a song in a way that can be effectively recognized by ASR through the air, in the presence of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Digital Media Forensic Detection · Network Security and Intrusion Detection