Local Training for PLDA in Speaker Verification

Chenghui Zhao; Lantian Li; Dong Wang; April Pu

arXiv:1609.08433·cs.SD·September 28, 2016·2 cites

Local Training for PLDA in Speaker Verification

Chenghui Zhao, Lantian Li, Dong Wang, April Pu

PDF

Open Access

TL;DR

This paper introduces a local training method for PLDA in speaker verification that uses inexpensive local labels to improve performance when global labels are scarce.

Contribution

It proposes a novel local training approach that leverages inexpensive local labels for PLDA adaptation, reducing the need for costly global labels.

Findings

01

Significant performance improvements with limited global labels

02

Effective use of local labels for PLDA training

03

Enhanced speaker verification accuracy

Abstract

PLDA is a popular normalization approach for the i-vector model, and it has delivered state-of-the-art performance in speaker verification. However, PLDA training requires a large amount of labeled development data, which is highly expensive in most cases. A possible approach to mitigate the problem is various unsupervised adaptation methods, which use unlabeled data to adapt the PLDA scattering matrices to the target domain. In this paper, we present a new `local training' approach that utilizes inaccurate but much cheaper local labels to train the PLDA model. These local labels discriminate speakers within a single conversion only, and so are much easier to obtain compared to the normal `global labels'. Our experiments show that the proposed approach can deliver significant performance improvement, particularly with limited globally-labeled data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Natural Language Processing Techniques