Evaluating the Accuracy of Automatic Speech Recognition Systems in Home Healthcare Settings
Dayoung Yu, Sasha Vergez, Grace Flaherty, Maryam Zolnoori, Nicole Onorato, Julia Hirschberg, Maxim Topaz, Margaret McDonald

TL;DR
This study evaluates how accurately speech recognition software transcribes conversations in home healthcare settings, finding that accuracy varies with conversation type and length.
Contribution
The study provides new insights into ASR accuracy in home healthcare settings, highlighting differences in performance based on utterance length and speaker type.
Findings
AWS-GT had a higher word error rate in clinician recordings (.26) compared to RA recordings (.19).
Short utterances (4-8 words) had the highest word error rate (.39).
Clinician recordings contained more short utterances than RA recordings (35% vs. 25%).
Abstract
Recent developments in automatic speech recognition (ASR) systems have resulted in increased use of automated transcription services to analyze verbal communications. We aimed to analyze the accuracy of AWS General Transcribe (AWS-GT) on audio recordings of home healthcare patients. These audio recordings included in-person clinician visits and phone calls with research assistants (RAs). Study staff rated the clarity of the audio using a scale of low, medium and high. The audio quality was similar among the two types of recordings. Our results are a part of a larger study aiming to use automated speech processing to identify risk factors for hospitalization and emergency department visits. Overall, 4002 utterances, defined as uninterrupted blocks of speech with four or more words, were analyzed— 3,472 (87%) from clinician recordings and 520 (13%) from RA recordings. Word error rate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiology practices and education · Voice and Speech Disorders · Electronic Health Records Systems
