TL;DR
This paper presents a method for detecting adversarial audio inputs in speech recognition systems by leveraging uncertainty quantification, achieving high detection accuracy and reducing attack success.
Contribution
It introduces a novel approach using uncertainty measures from various neural network models to effectively identify adversarial examples in speech recognition.
Findings
Detection AUC > 0.99 for adversarial examples
Uncertainty-based detection reduces attack success
Uncertainty models diminish system vulnerability
Abstract
Machine learning systems and also, specifically, automatic speech recognition (ASR) systems are vulnerable against adversarial attacks, where an attacker maliciously changes the input. In the case of ASR systems, the most interesting cases are targeted attacks, in which an attacker aims to force the system into recognizing given target transcriptions in an arbitrary audio sample. The increasing number of sophisticated, quasi imperceptible attacks raises the question of countermeasures. In this paper, we focus on hybrid ASR systems and compare four acoustic models regarding their ability to indicate uncertainty under attack: a feed-forward neural network and three neural networks specifically designed for uncertainty quantification, namely a Bayesian neural network, Monte Carlo dropout, and a deep ensemble. We employ uncertainty measures of the acoustic model to construct a simple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
