Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors
Vishwanath Pratap Singh, Md. Sahidullah, Tomi Kinnunen

TL;DR
This study systematically analyzes the causes of ASR errors in children's speech, identifying key physiological, cognitive, and external factors affecting model accuracy through causal inference and benchmarking of various speech foundation models.
Contribution
It provides the first comprehensive causal analysis of factors impacting children's ASR performance, comparing multiple models and identifying primary contributors to errors.
Findings
Age and number of words in audio have the highest impact on accuracy.
Fine-tuning reduces sensitivity to physiological and cognitive factors.
Background noise and pronunciation ability also significantly affect performance.
Abstract
The increasing use of children's automatic speech recognition (ASR) systems has spurred research efforts to improve the accuracy of models designed for children's speech in recent years. The current approach utilizes either open-source speech foundation models (SFMs) directly or fine-tuning them with children's speech data. These SFMs, whether open-source or fine-tuned for children, often exhibit higher word error rates (WERs) compared to adult speech. However, there is a lack of systemic analysis of the cause of this degraded performance of SFMs. Understanding and addressing the reasons behind this performance disparity is crucial for improving the accuracy of SFMs for children's speech. Our study addresses this gap by investigating the causes of accuracy degradation and the primary contributors to WER in children's speech. In the first part of the study, we conduct a comprehensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCardiac Arrest and Resuscitation · Risk and Safety Analysis
MethodsCausal inference
