Practical Attacks on Voice Spoofing Countermeasures
Andre Kassis, Urs Hengartner

TL;DR
This paper presents the first practical attack on voice spoofing countermeasures, demonstrating how malicious actors can craft audio samples to bypass voice authentication systems with high success rates, raising security concerns.
Contribution
The authors develop a novel joint loss function enabling advanced adversarial attacks on combined ASV/CM systems directly in the time domain, including a targeted over-telephony attack.
Findings
Achieved up to 93.57% success rate against state-of-the-art systems.
First practical attack on voice spoofing countermeasures.
Demonstrated real-world attack over telephony networks.
Abstract
Voice authentication has become an integral part in security-critical operations, such as bank transactions and call center conversations. The vulnerability of automatic speaker verification systems (ASVs) to spoofing attacks instigated the development of countermeasures (CMs), whose task is to tell apart bonafide and spoofed speech. Together, ASVs and CMs form today's voice authentication platforms, advertised as an impregnable access control mechanism. We develop the first practical attack on CMs, and show how a malicious actor may efficiently craft audio samples to bypass voice authentication in its strictest form. Previous works have primarily focused on non-proactive attacks or adversarial strategies against ASVs that do not produce speech in the victim's voice. The repercussions of our attacks are far more severe, as the samples we generate sound like the victim, eliminating any…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing
