A Speaker Verification Backend with Robust Performance across Conditions

Luciana Ferrer; Mitchell McLaren; Niko Brummer

arXiv:2102.01760·cs.SD·August 18, 2021

A Speaker Verification Backend with Robust Performance across Conditions

Luciana Ferrer, Mitchell McLaren, Niko Brummer

PDF

1 Repo

TL;DR

This paper introduces an adaptive backend for speaker verification that improves calibration and discrimination across diverse and unseen conditions by using duration and side-information, trained discriminatively.

Contribution

The paper proposes a novel adaptive calibrator integrated with PLDA, trained discriminatively, to enhance robustness and performance in varied conditions.

Findings

01

Significant calibration improvements on diverse datasets

02

Consistent discrimination performance enhancement

03

Joint training of PLDA and calibrator is essential

Abstract

In this paper, we address the problem of speaker verification in conditions unseen or unknown during development. A standard method for speaker verification consists of extracting speaker embeddings with a deep neural network and processing them through a backend composed of probabilistic linear discriminant analysis (PLDA) and global logistic regression score calibration. This method is known to result in systems that work poorly on conditions different from those used to train the calibration model. We propose to modify the standard backend, introducing an adaptive calibrator that uses duration and other automatically extracted side-information to adapt to the conditions of the inputs. The backend is trained discriminatively to optimize binary cross-entropy. When trained on a number of diverse datasets that are labeled only with respect to speaker, the proposed backend consistently…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

luferrer/DCA-PLDA
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLogistic Regression