A Comparison of Differential Performance Metrics for the Evaluation of   Automatic Speaker Verification Fairness

Oubaida Chouchane; Christoph Busch; Chiara Galdi; Nicholas Evans,; Massimiliano Todisco

arXiv:2404.17810·eess.AS·April 30, 2024

A Comparison of Differential Performance Metrics for the Evaluation of Automatic Speaker Verification Fairness

Oubaida Chouchane, Christoph Busch, Chiara Galdi, Nicholas Evans,, Massimiliano Todisco

PDF

Open Access

TL;DR

This paper compares three fairness metrics for automatic speaker verification, finding that GARBE uniquely satisfies key fairness criteria and highlighting the complex trade-offs between fairness and accuracy in system performance.

Contribution

It introduces a comparison of fairness metrics in ASV, extending prior face recognition work, and evaluates five state-of-the-art systems for fairness and accuracy.

Findings

01

GARBE is the only metric meeting all fairness criteria

02

A nuanced trade-off exists between fairness and verification accuracy

03

Evaluation of five ASV systems reveals complex fairness-accuracy interplay

Abstract

When decisions are made and when personal data is treated by automated processes, there is an expectation of fairness -- that members of different demographic groups receive equitable treatment. This expectation applies to biometric systems such as automatic speaker verification (ASV). We present a comparison of three candidate fairness metrics and extend previous work performed for face recognition, by examining differential performance across a range of different ASV operating points. Results show that the Gini Aggregation Rate for Biometric Equitability (GARBE) is the only one which meets three functional fairness measure criteria. Furthermore, a comprehensive evaluation of the fairness and verification performance of five state-of-the-art ASV systems is also presented. Our findings reveal a nuanced trade-off between fairness and verification accuracy underscoring the complex…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis