An MAP Estimation for Between-Class Variance

Jiao Han; Yunqi Cai; Lantian Li; Guanyu Li; Dong Wang

arXiv:2111.12331·cs.SD·November 25, 2021

An MAP Estimation for Between-Class Variance

Jiao Han, Yunqi Cai, Lantian Li, Guanyu Li, Dong Wang

PDF

Open Access

TL;DR

This paper introduces a Maximum A Posteriori (MAP) estimation method for the between-class variance in probabilistic linear discriminant analysis (PLDA), improving generalization in open-set verification tasks by addressing limited class data issues.

Contribution

It proposes a novel MAP estimation approach using an Inverse-Wishart prior for the between-class variance in hierarchical models like PLDA, with a tractable inference method.

Findings

01

MAP estimation improves verification performance

02

Method enhances generalization in limited class scenarios

03

Effective in PLDA scoring and length normalization

Abstract

Probabilistic linear discriminant analysis (PLDA) has been widely used in open-set verification tasks, such as speaker verification. A potential issue of this model is that the training set often contains limited number of classes, which makes the estimation for the between-class variance unreliable. This unreliable estimation often leads to degraded generalization. In this paper, we present an MAP estimation for the between-class variance, by employing an Inverse-Wishart prior. A key problem is that with hierarchical models such as PLDA, the prior is placed on the variance of class means while the likelihood is based on class members, which makes the posterior inference intractable. We derive a simple MAP estimation for such a model, and test it in both PLDA scoring and length normalization. In both cases, the MAP-based estimation delivers interesting performance improvement.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing