Adversarial Attacks on ASR Systems: An Overview
Xiao Zhang, Hao Tan, Xuan Huang, Denghui Zhang, Keke Tang, Zhaoquan Gu

TL;DR
This paper reviews adversarial attacks on ASR systems, discussing attack assumptions, methods, and their effects, highlighting the importance of understanding attack layers and implementation for improving system robustness.
Contribution
It provides a comprehensive overview of adversarial attack methods on ASR systems, emphasizing attack layer perturbations and their implications, which is a novel focus compared to previous surveys.
Findings
Different attack assumptions influence attack strategies
Perturbation layers significantly affect attack success
Understanding attack methods aids in developing robust ASR systems
Abstract
With the development of hardware and algorithms, ASR(Automatic Speech Recognition) systems evolve a lot. As The models get simpler, the difficulty of development and deployment become easier, ASR systems are getting closer to our life. On the one hand, we often use APPs or APIs of ASR to generate subtitles and record meetings. On the other hand, smart speaker and self-driving car rely on ASR systems to control AIoT devices. In past few years, there are a lot of works on adversarial examples attacks against ASR systems. By adding a small perturbation to the waveforms, the recognition results make a big difference. In this paper, we describe the development of ASR system, different assumptions of attacks, and how to evaluate these attacks. Next, we introduce the current works on adversarial examples attacks from two attack assumptions: white-box attack and black-box attack. Different from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Network Security and Intrusion Detection · Speech Recognition and Synthesis
