Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities
Pranav Dheram, Murugesan Ramakrishnan, Anirudh Raju, I-Fan Chen, Brian, King, Katherine Powell, Melissa Saboowala, Karan Shetty, Andreas Stolcke

TL;DR
This paper investigates performance disparities in speech recognition systems across different user groups, proposing methods to discover and mitigate these disparities to improve fairness without sacrificing accuracy.
Contribution
It introduces scalable cohort discovery methods using speaker embeddings and evaluates mitigation strategies like oversampling and additional input variables.
Findings
Speaker embeddings effectively identify diverse cohorts.
Oversampling reduces performance gaps.
Fairness mitigation does not harm overall accuracy.
Abstract
As for other forms of AI, speech recognition has recently been examined with respect to performance disparities across different user cohorts. One approach to achieve fairness in speech recognition is to (1) identify speaker cohorts that suffer from subpar performance and (2) apply fairness mitigation measures targeting the cohorts discovered. In this paper, we report on initial findings with both discovery and mitigation of performance disparities using data from a product-scale AI assistant speech recognition system. We compare cohort discovery based on geographic and demographic information to a more scalable method that groups speakers without human labels, using speaker embedding technology. For fairness mitigation, we find that oversampling of underrepresented cohorts, as well as modeling speaker cohort membership by additional input variables, reduces the gap between top- and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · AI and HR Technologies · Computational and Text Analysis Methods
