Toward Fair Speech Technologies: A Comprehensive Survey of Bias and Fairness in Speech AI

Yi-Cheng Lin; Yun-Shao Tsai; Kuan-Yu Chen; Hsiao-Ying Huang; Huang-Cheng Chou; Hung-yi Lee

arXiv:2605.01597·eess.AS·May 5, 2026

Toward Fair Speech Technologies: A Comprehensive Survey of Bias and Fairness in Speech AI

Yi-Cheng Lin, Yun-Shao Tsai, Kuan-Yu Chen, Hsiao-Ying Huang, Huang-Cheng Chou, Hung-yi Lee

PDF

TL;DR

This survey comprehensively reviews bias and fairness issues in speech AI, proposing a unified framework, formal definitions, evaluation metrics, and mitigation strategies tailored to speech-specific challenges.

Contribution

It introduces a formalized set of fairness definitions for speech, links evaluation metrics to these definitions, and systematically categorizes bias sources and mitigation methods in speech AI.

Findings

01

Identifies speech-specific bias mechanisms like channel bias and annotation subjectivity.

02

Provides a decision tree for selecting appropriate fairness evaluation metrics.

03

Maps mitigation strategies to specific bias sources in speech processing.

Abstract

Speech technologies are deployed in high-stakes settings, yet fairness concerns remain fragmented across tasks and disciplines. Existing surveys either adopt a general machine-learning perspective that overlooks speech-specific properties or focus on a single task, missing failure patterns shared across the speech domain. Synthesizing over 400 studies spanning generation and perception tasks and emerging speech-language models, this survey presents a unified framework that links formal fairness definitions to evaluation, diagnosis, and mitigation. We formalize seven fairness definitions adapted to the speech modality and organize the field's conceptual evolution through three paradigms: Robustness, Representation, and Governance. We then ground evaluation metrics in the mathematical cores of these definitions and offer a decision tree for metric selection. We diagnose bias sources along…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.