Practical Attacks on Voice Spoofing Countermeasures

Andre Kassis; Urs Hengartner

arXiv:2107.14642·cs.CR·August 2, 2021·1 cites

Practical Attacks on Voice Spoofing Countermeasures

Andre Kassis, Urs Hengartner

PDF

Open Access

TL;DR

This paper presents the first practical attack on voice spoofing countermeasures, demonstrating how malicious actors can craft audio samples to bypass voice authentication systems with high success rates, raising security concerns.

Contribution

The authors develop a novel joint loss function enabling advanced adversarial attacks on combined ASV/CM systems directly in the time domain, including a targeted over-telephony attack.

Findings

01

Achieved up to 93.57% success rate against state-of-the-art systems.

02

First practical attack on voice spoofing countermeasures.

03

Demonstrated real-world attack over telephony networks.

Abstract

Voice authentication has become an integral part in security-critical operations, such as bank transactions and call center conversations. The vulnerability of automatic speaker verification systems (ASVs) to spoofing attacks instigated the development of countermeasures (CMs), whose task is to tell apart bonafide and spoofed speech. Together, ASVs and CMs form today's voice authentication platforms, advertised as an impregnable access control mechanism. We develop the first practical attack on CMs, and show how a malicious actor may efficiently craft audio samples to bypass voice authentication in its strictest form. Previous works have primarily focused on non-proactive attacks or adversarial strategies against ASVs that do not produce speech in the victim's voice. The repercussions of our attacks are far more severe, as the samples we generate sound like the victim, eliminating any…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing