Application of Whisper in Clinical Practice: the Post-Stroke Speech Assessment during a Naming Task
Milena Davudova, Ziyuan Cai, Valentina Giunchiglia, Dragos C. Gruia, Giulia Sanguedolce, Adam Hampshire, Fatemeh Geranmayeh

TL;DR
This study evaluates Whisper, a state-of-the-art ASR model, for transcribing and analyzing speech in post-stroke patients during a naming task, highlighting the importance of fine-tuning for clinical applications.
Contribution
The paper demonstrates that fine-tuning Whisper significantly improves transcription accuracy and supports language function prediction in stroke patients, addressing challenges in clinical speech assessment.
Findings
Fine-tuning reduces Word Error Rate by over 87% in healthy speech.
Learned representations enable accurate prediction of speech quality.
Limited generalizability on unseen clinical speech datasets.
Abstract
Detailed assessment of language impairment following stroke remains a cognitively complex and clinician-intensive task, limiting timely and scalable diagnosis. Automatic Speech Recognition (ASR) foundation models offer a promising pathway to augment human evaluation through intelligent systems, but their effectiveness in the context of speech and language impairment remains uncertain. In this study, we evaluate whether Whisper, a state-of-the-art ASR foundation model, can be applied to transcribe and analyze speech from patients with stroke during a commonly used picture-naming task. We assess both verbatim transcription accuracy and the model's ability to support downstream prediction of language function, which has major implications for outcomes after stroke. Our results show that the baseline Whisper model performs poorly on single-word speech utterances. Nevertheless, fine-tuning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInterpreting and Communication in Healthcare · Educational Reforms and Innovations · Translation Studies and Practices
