Disappeared Command: Spoofing Attack On Automatic Speech Recognition   Systems with Sound Masking

Jinghui Xu; Jifeng Zhu; Yong Yang

arXiv:2204.08977·cs.SD·June 9, 2022

Disappeared Command: Spoofing Attack On Automatic Speech Recognition Systems with Sound Masking

Jinghui Xu, Jifeng Zhu, Yong Yang

PDF

Open Access

TL;DR

This paper introduces a spoofing attack on automatic speech recognition systems using sound masking, highlighting vulnerabilities in deep learning-based ASR technology and its potential security risks.

Contribution

The paper presents a novel sound masking attack method that can deceive ASR systems, exposing security flaws in current deep learning speech recognition models.

Findings

01

Sound masking can effectively spoof ASR systems.

02

Deep learning ASR systems are vulnerable to subtle audio disturbances.

03

Potential security risks in voice-controlled applications.

Abstract

The development of deep learning technology has greatly promoted the performance improvement of automatic speech recognition (ASR) technology, which has demonstrated an ability comparable to human hearing in many tasks. Voice interfaces are becoming more and more widely used as input for many applications and smart devices. However, existing research has shown that DNN is easily disturbed by slight disturbances and makes false recognition, which is extremely dangerous for intelligent voice applications controlled by voice.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing