Study on the Fairness of Speaker Verification Systems on Underrepresented Accents in English
Mariel Estevez, Luciana Ferrer

TL;DR
This study evaluates the fairness of speaker verification systems across different English accents, revealing biases related to underrepresented accents and demonstrating that data balancing can improve fairness.
Contribution
The paper introduces a new accent-diverse dataset and shows that simple data balancing techniques can mitigate accent bias in speaker verification systems.
Findings
Calibration performance degrades on underrepresented accents
Data balancing improves system fairness
Bias mitigation is effective with the discriminative condition-aware backend
Abstract
Speaker verification (SV) systems are currently being used to make sensitive decisions like giving access to bank accounts or deciding whether the voice of a suspect coincides with that of the perpetrator of a crime. Ensuring that these systems are fair and do not disfavor any particular group is crucial. In this work, we analyze the performance of several state-of-the-art SV systems across groups defined by the accent of the speakers when speaking English. To this end, we curated a new dataset based on the VoxCeleb corpus where we carefully selected samples from speakers with accents from different countries. We use this dataset to evaluate system performance for several SV systems trained with VoxCeleb data. We show that, while discrimination performance is reasonably robust across accent groups, calibration performance degrades dramatically on some accents that are not well…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques · Authorship Attribution and Profiling
