Lost in Transcription, Found in Distribution Shift: Demystifying Hallucination in Speech Foundation Models
Hanin Atwany, Abdul Waheed, Rita Singh, Monojit Choudhury, Bhiksha Raj

TL;DR
This paper investigates hallucination in speech recognition models, revealing how distribution shifts and noise impact hallucination rates, and proposes HER as a new metric to better evaluate transcription reliability in critical applications.
Contribution
It introduces the hallucination error rate (HER) metric and analyzes how model size, architecture, and distribution shifts influence hallucination in ASR systems.
Findings
HER increases with distribution shift ($\alpha=0.91$)
High WER can hide hallucinations, low WER can conceal dangerous errors
Synthetic noise elevates HER significantly
Abstract
Speech foundation models trained at a massive scale, both in terms of model and data size, result in robust systems capable of performing multiple speech tasks, including automatic speech recognition (ASR). These models transcend language and domain barriers, yet effectively measuring their performance remains a challenge. Traditional metrics like word error rate (WER) and character error rate (CER) are commonly used to evaluate ASR performance but often fail to reflect transcription quality in critical contexts, particularly when detecting fabricated outputs. This phenomenon, known as hallucination, is especially concerning in high-stakes domains such as healthcare, legal, and aviation, where errors can have severe consequences. In our work, we address this gap by investigating hallucination in ASR models. We examine how factors such as distribution shifts, model size, and model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFractal and DNA sequence analysis · Cognitive Science and Education Research
