A Study on Decoupled Probabilistic Linear Discriminant Analysis

Di Wang; Lantian Li; Hongzhi Yu; Dong Wang

arXiv:2111.12326·cs.SD·November 25, 2021·1 cites

A Study on Decoupled Probabilistic Linear Discriminant Analysis

Di Wang, Lantian Li, Hongzhi Yu, Dong Wang

PDF

Open Access

TL;DR

This paper introduces a decoupled approach to probabilistic linear discriminant analysis (PLDA) that combines global simplicity with local complexity, improving speaker verification performance on raw vectors but with limitations after normalization.

Contribution

It proposes a novel decoupling method for PLDA that separates global and local modeling, enhancing flexibility while maintaining generalization.

Findings

01

Decoupled PLDA outperforms vanilla PLDA on raw speaker vectors.

02

The advantage diminishes after length normalization of vectors.

03

Future work suggested on non-linear local models.

Abstract

Probabilistic linear discriminant analysis (PLDA) has broad application in open-set verification tasks, such as speaker verification. A key concern for PLDA is that the model is too simple (linear Gaussian) to deal with complicated data; however, the simplicity by itself is a major advantage of PLDA, as it leads to desirable generalization. An interesting research therefore is how to improve modeling capacity of PLDA while retaining the simplicity. This paper presents a decoupling approach, which involves a global model that is simple and generalizable, and a local model that is complex and expressive. While the global model holds a bird view on the entire data, the local model represents the details of individual classes. We conduct a preliminary study towards this direction and investigate a simple decoupling model including both the global and local models. The new model, which we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Natural Language Processing Techniques