Detecting Audio Adversarial Examples with Logit Noising
Namgyu Park, Sangwoo Ji, Jong Kim

TL;DR
This paper introduces a simple yet effective detection method for audio adversarial examples in speech recognition systems by adding noise to logits, which significantly alters adversarial transcriptions while minimally affecting benign audio.
Contribution
The authors propose a novel logit noising technique for detecting audio adversarial examples without requiring changes to ASR system architecture or retraining.
Findings
Effective detection of over-line audio adversarial examples
Robustness against over-air adversarial attacks
No structural changes needed in ASR systems
Abstract
Automatic speech recognition (ASR) systems are vulnerable to audio adversarial examples that attempt to deceive ASR systems by adding perturbations to benign speech signals. Although an adversarial example and the original benign wave are indistinguishable to humans, the former is transcribed as a malicious target sentence by ASR systems. Several methods have been proposed to generate audio adversarial examples and feed them directly into the ASR system (over-line). Furthermore, many researchers have demonstrated the feasibility of robust physical audio adversarial examples(over-air). To defend against the attacks, several studies have been proposed. However, deploying them in a real-world situation is difficult because of accuracy drop or time overhead. In this paper, we propose a novel method to detect audio adversarial examples by adding noise to the logits before feeding them into…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
