Fairness of Automatic Speech Recognition: Looking Through a Philosophical Lens
Anna Seo Gyeong Choi, Hoon Choi

TL;DR
This paper critically examines the fairness of Automatic Speech Recognition systems through a philosophical perspective, highlighting ethical issues related to linguistic diversity, bias, and social justice.
Contribution
It introduces a philosophical framework to understand ASR bias, emphasizing ethical dimensions and proposing that addressing bias requires recognizing diverse speech varieties as legitimate.
Findings
ASR bias involves ethical dimensions like temporal taxation and cultural identity.
Current fairness metrics fail to capture power asymmetries caused by ASR misrecognition.
Addressing bias requires recognition of linguistic diversity beyond technical fixes.
Abstract
Automatic Speech Recognition (ASR) systems now mediate countless human-technology interactions, yet research on their fairness implications remains surprisingly limited. This paper examines ASR bias through a philosophical lens, arguing that systematic misrecognition of certain speech varieties constitutes more than a technical limitation -- it represents a form of disrespect that compounds historical injustices against marginalized linguistic communities. We distinguish between morally neutral classification (discriminate1) and harmful discrimination (discriminate2), demonstrating how ASR systems can inadvertently transform the former into the latter when they consistently misrecognize non-standard dialects. We identify three unique ethical dimensions of speech technologies that differentiate ASR bias from other algorithmic fairness concerns: the temporal burden placed on speakers of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPhonetics and Phonology Research · Speech Recognition and Synthesis · ICT in Developing Communities
