A Sociolinguistic Analysis of Automatic Speech Recognition Bias in Newcastle English
Dana Serditova, Kevin Tang

TL;DR
This study analyzes how dialectal and social variations in Newcastle English influence ASR errors, revealing that phonological features and social factors significantly affect recognition accuracy, highlighting the need for sociolinguistic considerations in ASR development.
Contribution
It provides a sociolinguistic framework for understanding ASR bias in regional dialects, emphasizing the importance of community-based data and dialect-specific features for improving speech technology.
Findings
Phonological variation explains most recognition errors.
Error rates are higher for men and extreme age groups.
Dialect-specific features like vowel quality cause recurrent failures.
Abstract
Automatic Speech Recognition (ASR) systems are widely used in everyday communication, education, healthcare, and industry, yet their performance remains uneven across speakers, particularly when dialectal variation diverges from the mainstream accents represented in training data. This study investigates ASR bias through a sociolinguistic analysis of Newcastle English, a regional variety of North-East England that has been shown to challenge current speech recognition technologies. Using spontaneous speech from the Diachronic Electronic Corpus of Tyneside English (DECTE), we evaluate the output of a state-of-the-art commercial ASR system and conduct a fine-grained analysis of more than 3,000 transcription errors. Errors are classified by linguistic domain and examined in relation to social variables including gender, age, and socioeconomic status. In addition, an acoustic case study of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLinguistic Variation and Morphology · Phonetics and Phonology Research · Speech Recognition and Synthesis
