Joint PLDA for Simultaneous Modeling of Two Factors
Luciana Ferrer, Mitchell McLaren

TL;DR
This paper introduces a generalized joint PLDA model that simultaneously captures class and nuisance factors, improving biometric verification performance, especially in multilingual speaker recognition scenarios.
Contribution
It extends PLDA to jointly model class and nuisance factors without changing its core structure, enhancing biometric recognition accuracy.
Findings
Significant performance improvements in multilingual speaker verification.
Effective handling of nuisance conditions during training and testing.
Robustness when training data is predominantly monolingual.
Abstract
Probabilistic linear discriminant analysis (PLDA) is a method used for biometric problems like speaker or face recognition that models the variability of the samples using two latent variables, one that depends on the class of the sample and another one that is assumed independent across samples and models the within-class variability. In this work, we propose a generalization of PLDA that enables joint modeling of two sample-dependent factors: the class of interest and a nuisance condition. The approach does not change the basic form of PLDA but rather modifies the training procedure to consider the dependency across samples of the latent variable that models within-class variability. While the identity of the nuisance condition is needed during training, it is not needed during testing since we propose a scoring procedure that marginalizes over the corresponding latent variable. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing
