Incorporating Pass-Phrase Dependent Background Models for Text-Dependent   Speaker Verification

A. K. Sarkar; Zheng-Hua Tan

arXiv:1611.06423·cs.CL·March 29, 2021

Incorporating Pass-Phrase Dependent Background Models for Text-Dependent Speaker Verification

A. K. Sarkar, Zheng-Hua Tan

PDF

Open Access

TL;DR

This paper introduces pass-phrase dependent background models for text-dependent speaker verification, integrating pass-phrase identification into the verification process to improve accuracy, especially for non-target errors.

Contribution

It proposes a novel method of using PBMs derived from background models to incorporate pass-phrase recognition into TD-SV, enhancing verification performance.

Findings

01

Significant reduction in non-target error rates.

02

Maintains comparable performance for correct imposters.

03

Effective on short utterance datasets.

Abstract

In this paper, we propose pass-phrase dependent background models (PBMs) for text-dependent (TD) speaker verification (SV) to integrate the pass-phrase identification process into the conventional TD-SV system, where a PBM is derived from a text-independent background model through adaptation using the utterances of a particular pass-phrase. During training, pass-phrase specific target speaker models are derived from the particular PBM using the training data for the respective target model. While testing, the best PBM is first selected for the test utterance in the maximum likelihood (ML) sense and the selected PBM is then used for the log likelihood ratio (LLR) calculation with respect to the claimant model. The proposed method incorporates the pass-phrase identification step in the LLR calculation, which is not considered in conventional standalone TD-SV systems. The performance of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing