Recent improvements of ASR models in the face of adversarial attacks
Raphael Olivier, Bhiksha Raj

TL;DR
This paper evaluates the robustness of various ASR models against diverse adversarial attacks, highlighting the importance of systemic evaluation and the impact of training choices like self-supervised pretraining.
Contribution
It introduces a comprehensive, systemic evaluation framework for adversarial attacks on multiple ASR architectures and analyzes factors affecting robustness, including training methods.
Findings
Attack effectiveness varies with model architecture.
Some attack results are unreliable.
Self-supervised pretraining improves robustness.
Abstract
Like many other tasks involving neural networks, Speech Recognition models are vulnerable to adversarial attacks. However recent research has pointed out differences between attacks and defenses on ASR models compared to image models. Improving the robustness of ASR models requires a paradigm shift from evaluating attacks on one or a few models to a systemic approach in evaluation. We lay the ground for such research by evaluating on various architectures a representative set of adversarial attacks: targeted and untargeted, optimization and speech processing-based, white-box, black-box and targeted attacks. Our results show that the relative strengths of different attack algorithms vary considerably when changing the model architecture, and that the results of some attacks are not to be blindly trusted. They also indicate that training choices such as self-supervised pretraining can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Geophysical Methods and Applications · Anomaly Detection Techniques and Applications
