ASR-FAIRBENCH: Measuring and Benchmarking Equity Across Speech Recognition Systems

Anand Rai; Satyam Rahangdale; Utkarsh Anand; Animesh Mukherjee

arXiv:2505.11572·cs.SD·May 20, 2025

ASR-FAIRBENCH: Measuring and Benchmarking Equity Across Speech Recognition Systems

Anand Rai, Satyam Rahangdale, Utkarsh Anand, Animesh Mukherjee

PDF

Open Access

TL;DR

This paper introduces ASR-FAIRBENCH, a comprehensive benchmarking framework for evaluating both accuracy and fairness of speech recognition systems across diverse demographic groups, highlighting disparities and guiding more inclusive development.

Contribution

We present the ASR-FAIRBENCH leaderboard and a novel fairness score based on demographic data, enabling real-time assessment of ASR systems' equity and accuracy.

Findings

01

Significant performance disparities across demographic groups in SOTA ASR models

02

The FAIRBENCH framework effectively measures both accuracy and fairness

03

Benchmark results highlight the need for more inclusive ASR development

Abstract

Automatic Speech Recognition (ASR) systems have become ubiquitous in everyday applications, yet significant disparities in performance across diverse demographic groups persist. In this work, we introduce the ASR-FAIRBENCH leaderboard which is designed to assess both the accuracy and equity of ASR models in real-time. Leveraging the Meta's Fair-Speech dataset, which captures diverse demographic characteristics, we employ a mixed-effects Poisson regression model to derive an overall fairness score. This score is integrated with traditional metrics like Word Error Rate (WER) to compute the Fairness Adjusted ASR Score (FAAS), providing a comprehensive evaluation framework. Our approach reveals significant performance disparities in SOTA ASR models across demographic groups and offers a benchmark to drive the development of more inclusive ASR technologies.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Face recognition and analysis