Gaussian meta-embeddings for efficient scoring of a heavy-tailed PLDA   model

Niko Brummer; Anna Silnova; Lukas Burget; Themos Stafylakis

arXiv:1802.09777·stat.ML·February 28, 2018

Gaussian meta-embeddings for efficient scoring of a heavy-tailed PLDA model

Niko Brummer, Anna Silnova, Lukas Burget, Themos Stafylakis

PDF

TL;DR

This paper introduces Gaussian meta-embeddings derived from heavy-tailed PLDA models, enabling uncertainty propagation and improving speaker recognition accuracy over traditional GPLDA methods, especially on i-vectors without length normalization.

Contribution

It generalizes Gaussian PLDA to heavy-tailed PLDA for Gaussian meta-embeddings with variable precisions, enhancing uncertainty propagation and recognition performance.

Findings

01

Up to 20% accuracy improvement on NIST SRE datasets.

02

GMEs with variable precisions propagate uncertainty effectively.

03

Method outperforms GPLDA on unnormalized i-vectors.

Abstract

Embeddings in machine learning are low-dimensional representations of complex input patterns, with the property that simple geometric operations like Euclidean distances and dot products can be used for classification and comparison tasks. The proposed meta-embeddings are special embeddings that live in more general inner product spaces. They are designed to propagate uncertainty to the final output in speaker recognition and similar applications. The familiar Gaussian PLDA model (GPLDA) can be re-formulated as an extractor for Gaussian meta-embeddings (GMEs), such that likelihood ratio scores are given by Hilbert space inner products between Gaussian likelihood functions. GMEs extracted by the GPLDA model have fixed precisions and do not propagate uncertainty. We show that a generalization to heavy-tailed PLDA gives GMEs with variable precisions, which do propagate uncertainty.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.