Watch What You Pretrain For: Targeted, Transferable Adversarial Examples on Self-Supervised Speech Recognition models
Raphael Olivier, Hadi Abdullah, Bhiksha Raj

TL;DR
This paper demonstrates that modern self-supervised speech recognition models are vulnerable to targeted adversarial attacks, achieving high transferability rates even in black-box settings, highlighting security concerns.
Contribution
The study reveals that self-supervised learning-based ASR models are susceptible to transferability of adversarial examples, providing empirical evidence and explanations for this vulnerability.
Findings
Self-supervised ASR models can be fooled with high transferability.
Low-level additive noise can achieve up to 80% attack success.
Self-supervised learning is identified as the main cause of vulnerability.
Abstract
A targeted adversarial attack produces audio samples that can force an Automatic Speech Recognition (ASR) system to output attacker-chosen text. To exploit ASR models in real-world, black-box settings, an adversary can leverage the transferability property, i.e. that an adversarial sample produced for a proxy ASR can also fool a different remote ASR. However recent work has shown that transferability against large ASR models is very difficult. In this work, we show that modern ASR architectures, specifically ones based on Self-Supervised Learning, are in fact vulnerable to transferability. We successfully demonstrate this phenomenon by evaluating state-of-the-art self-supervised ASR models like Wav2Vec2, HuBERT, Data2Vec and WavLM. We show that with low-level additive noise achieving a 30dB Signal-Noise Ratio, we can achieve target transferability with up to 80% accuracy. Next, we 1)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Speech Recognition and Synthesis · Anomaly Detection Techniques and Applications
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Softmax · Dropout · Dense Connections · Residual Connection · Absolute Position Encodings · Position-Wise Feed-Forward Layer
