Spoofing-Aware Speaker Verification with Unsupervised Domain Adaptation
Xuechen Liu, Md Sahidullah, Tomi Kinnunen

TL;DR
This paper improves the robustness of automatic speaker verification systems against spoofing attacks by applying unsupervised domain adaptation techniques to the back-end classifier, enhancing performance especially against replay attacks.
Contribution
It introduces unsupervised domain adaptation methods to enhance spoofing robustness in speaker verification without needing dedicated countermeasure modules.
Findings
Significant performance improvements on logical and physical access scenarios.
Maximum 36.1% relative improvement on bonafide cases.
Enhanced robustness against replayed audio attacks.
Abstract
In this paper, we initiate the concern of enhancing the spoofing robustness of the automatic speaker verification (ASV) system, without the primary presence of a separate countermeasure module. We start from the standard ASV framework of the ASVspoof 2019 baseline and approach the problem from the back-end classifier based on probabilistic linear discriminant analysis. We employ three unsupervised domain adaptation techniques to optimize the back-end using the audio data in the training partition of the ASVspoof 2019 dataset. We demonstrate notable improvements on both logical and physical access scenarios, especially on the latter where the system is attacked by replayed audios, with a maximum of 36.1% and 5.3% relative improvement on bonafide and spoofed cases, respectively. We perform additional studies such as per-attack breakdown analysis, data composition, and integration with a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing
