An MAP Estimation for Between-Class Variance
Jiao Han, Yunqi Cai, Lantian Li, Guanyu Li, Dong Wang

TL;DR
This paper introduces a Maximum A Posteriori (MAP) estimation method for the between-class variance in probabilistic linear discriminant analysis (PLDA), improving generalization in open-set verification tasks by addressing limited class data issues.
Contribution
It proposes a novel MAP estimation approach using an Inverse-Wishart prior for the between-class variance in hierarchical models like PLDA, with a tractable inference method.
Findings
MAP estimation improves verification performance
Method enhances generalization in limited class scenarios
Effective in PLDA scoring and length normalization
Abstract
Probabilistic linear discriminant analysis (PLDA) has been widely used in open-set verification tasks, such as speaker verification. A potential issue of this model is that the training set often contains limited number of classes, which makes the estimation for the between-class variance unreliable. This unreliable estimation often leads to degraded generalization. In this paper, we present an MAP estimation for the between-class variance, by employing an Inverse-Wishart prior. A key problem is that with hierarchical models such as PLDA, the prior is placed on the variance of class means while the likelihood is based on class members, which makes the posterior inference intractable. We derive a simple MAP estimation for such a model, and test it in both PLDA scoring and length normalization. In both cases, the MAP-based estimation delivers interesting performance improvement.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing
