Mitigating Closed-model Adversarial Examples with Bayesian Neural   Modeling for Enhanced End-to-End Speech Recognition

Chao-Han Huck Yang; Zeeshan Ahmed; Yile Gu; Joseph Szurley; Roger Ren,; Linda Liu; Andreas Stolcke; Ivan Bulyko

arXiv:2202.08532·eess.AS·February 18, 2022

Mitigating Closed-model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition

Chao-Han Huck Yang, Zeeshan Ahmed, Yile Gu, Joseph Szurley, Roger Ren,, Linda Liu, Andreas Stolcke, Ivan Bulyko

PDF

Open Access

TL;DR

This paper introduces a Bayesian neural network-based adversarial detection method to improve the robustness of end-to-end speech recognition systems against closed-model adversarial noise, demonstrating significant detection and accuracy improvements.

Contribution

The work presents a novel Bayesian neural network approach for detecting adversarial noise in speech recognition, specifically tailored for closed-model attack scenarios.

Findings

01

Detection rate improved by 2.77 to 5.42%

02

Word error rate reduced by 5.02 to 7.47%

03

Effective across multiple ASR architectures

Abstract

In this work, we aim to enhance the system robustness of end-to-end automatic speech recognition (ASR) against adversarially-noisy speech examples. We focus on a rigorous and empirical "closed-model adversarial robustness" setting (e.g., on-device or cloud applications). The adversarial noise is only generated by closed-model optimization (e.g., evolutionary and zeroth-order estimation) without accessing gradient information of a targeted ASR model directly. We propose an advanced Bayesian neural network (BNN) based adversarial detector, which could model latent distributions against adaptive adversarial perturbation with divergence measurement. We further simulate deployment scenarios of RNN Transducer, Conformer, and wav2vec-2.0 based ASR systems with the proposed adversarial detection system. Leveraging the proposed BNN based detection system, we improve detection rate by +2.77 to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Speech Recognition and Synthesis