Optimizing a-DCF for Spoofing-Robust Speaker Verification

O\u{g}uzhan Kurnaz; Jagabandhu Mishra; Tomi H. Kinnunen; and Cemal; Hanil\c{c}i

arXiv:2407.04034·eess.AS·March 4, 2025·IEEE Signal Process. Lett.

Optimizing a-DCF for Spoofing-Robust Speaker Verification

O\u{g}uzhan Kurnaz, Jagabandhu Mishra, Tomi H. Kinnunen, and Cemal, Hanil\c{c}i

PDF

Open Access

TL;DR

This paper introduces an optimized spoofing-robust speaker verification system using a-DCF, achieving significant improvements in detection cost function over previous methods by combining a-DCF with BCE and novel thresholding.

Contribution

It presents a new method that directly optimizes a-DCF for spoofing-robust speaker verification, integrating threshold optimization and fusion techniques for enhanced performance.

Findings

01

13% relative improvement over BCE-only system

02

43% relative improvement with non-linear score fusion

03

Significant reduction in minimum a-DCF scores

Abstract

Automatic speaker verification (ASV) systems are vulnerable to spoofing attacks. We propose a spoofing-robust ASV system optimized directly for the recently introduced architecture-agnostic detection cost function (a-DCF), which allows targeting a desired trade-off between the contradicting aims of user convenience and robustness to spoofing. We combine a-DCF and binary cross-entropy (BCE) with a novel straightforward threshold optimization technique. Our results with an embedding fusion system on ASVspoof2019 data demonstrate relative improvement of $13%$ over a system trained using BCE only (from minimum a-DCF of $0.1445$ to $0.1254$ ). Using an alternative non-linear score fusion approach provides relative improvement of $43%$ (from minimum a-DCF of $0.0508$ to $0.0289$ ).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Advanced Data Compression Techniques · Speech Recognition and Synthesis