A Generalized Framework for Domain Adaptation of PLDA in Speaker Recognition
Qiongqiong Wang, Koji Okabe, Kong Aik Lee, Takafumi Koshinaka

TL;DR
This paper introduces a flexible domain adaptation framework for PLDA in speaker recognition, incorporating new correlation-alignment and covariance regularization techniques that improve robustness and performance across domains.
Contribution
It presents a generalized domain adaptation framework for PLDA, including novel correlation-alignment-based interpolation and covariance regularization methods.
Findings
Correlation-alignment-based interpolation reduces minCprimary by up to 30.5%.
Regularization enhances robustness to interpolation weight variations.
Performance surpasses traditional methods in domain adaptation scenarios.
Abstract
This paper proposes a generalized framework for domain adaptation of Probabilistic Linear Discriminant Analysis (PLDA) in speaker recognition. It not only includes several existing supervised and unsupervised domain adaptation methods but also makes possible more flexible usage of available data in different domains. In particular, we introduce here the two new techniques described below. (1) Correlation-alignment-based interpolation and (2) covariance regularization. The proposed correlation-alignment-based interpolation method decreases minCprimary up to 30.5% as compared with that from an out-of-domain PLDA model before adaptation, and minCprimary is also 5.5% lower than with a conventional linear interpolation method with optimal interpolation weights. Further, the proposed regularization technique ensures robustness in interpolations w.r.t. varying interpolation weights, which in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing
