Two Methods for Spoofing-Aware Speaker Verification: Multi-Layer   Perceptron Score Fusion Model and Integrated Embedding Projector

Jungwoo Heo; Ju-ho Kim; Hyun-seo Shin

arXiv:2206.13807·eess.AS·June 14, 2023

Two Methods for Spoofing-Aware Speaker Verification: Multi-Layer Perceptron Score Fusion Model and Integrated Embedding Projector

Jungwoo Heo, Ju-ho Kim, Hyun-seo Shin

PDF

Open Access

TL;DR

This paper introduces two novel back-end systems, MSFM and IEP, that effectively combine speaker verification and spoofing countermeasures, significantly improving spoofing-aware speaker verification performance.

Contribution

The paper presents two new methods, MSFM and IEP, for integrating ASV and spoofing countermeasures, advancing spoofing-aware speaker verification techniques.

Findings

01

Achieved SASV EER of 0.56% and 1.32% on SASV 2022 challenge data.

02

Effectively integrated ASV and CM systems using proposed methods.

03

Demonstrated significant performance improvements over existing approaches.

Abstract

The use of deep neural networks (DNN) has dramatically elevated the performance of automatic speaker verification (ASV) over the last decade. However, ASV systems can be easily neutralized by spoofing attacks. Therefore, the Spoofing-Aware Speaker Verification (SASV) challenge is designed and held to promote development of systems that can perform ASV considering spoofing attacks by integrating ASV and spoofing countermeasure (CM) systems. In this paper, we propose two back-end systems: multi-layer perceptron score fusion model (MSFM) and integrated embedding projector (IEP). The MSFM, score fusion back-end system, derived SASV score utilizing ASV and CM scores and embeddings. On the other hand,IEP combines ASV and CM embeddings into SASV embedding and calculates final SASV score based on the cosine similarity. We effectively integrated ASV and CM systems through proposed MSFM and IEP…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Voice and Speech Disorders