SoK: A Modularized Approach to Study the Security of Automatic Speech   Recognition Systems

Yuxuan Chen; Jiangshan Zhang; Xuejing Yuan; Shengzhi Zhang; Kai Chen,; Xiaofeng Wang; Shanqing Guo

arXiv:2103.10651·cs.CR·August 2, 2021·1 cites

SoK: A Modularized Approach to Study the Security of Automatic Speech Recognition Systems

Yuxuan Chen, Jiangshan Zhang, Xuejing Yuan, Shengzhi Zhang, Kai Chen,, Xiaofeng Wang, Shanqing Guo

PDF

Open Access 1 Repo

TL;DR

This paper systematically reviews the security of Automatic Speech Recognition systems, categorizing attacks and defenses, and draws parallels with image recognition security to identify challenges and future directions.

Contribution

It provides a comprehensive taxonomy of ASR security based on a modular workflow and aligns it with image recognition security research for better understanding.

Findings

01

Transfer learning across ASR models is feasible without knowledge of models or data.

02

A modularized taxonomy for ASR security attacks and defenses.

03

Comparison with image recognition security highlights unique challenges in ASR.

Abstract

With the wide use of Automatic Speech Recognition (ASR) in applications such as human machine interaction, simultaneous interpretation, audio transcription, etc., its security protection becomes increasingly important. Although recent studies have brought to light the weaknesses of popular ASR systems that enable out-of-band signal attack, adversarial attack, etc., and further proposed various remedies (signal smoothing, adversarial training, etc.), a systematic understanding of ASR security (both attacks and defenses) is still missing, especially on how realistic such threats are and how general existing protection could be. In this paper, we present our systematization of knowledge for ASR security and provide a comprehensive taxonomy for existing work based on a modularized workflow. More importantly, we align the research in this domain with that on security in Image Recognition…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

AsrAttackSok/asr-attack-sok
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Digital Media Forensic Detection · Geophysical Methods and Applications