Influence of ASR and Language Model on Alzheimer's Disease Detection
Joan Codina-Filb\`a, Guillermo C\'ambara, Jordi Luque, Mireia, Farr\'us

TL;DR
This paper investigates how automatic speech recognition and language models affect Alzheimer's detection accuracy from speech, highlighting the potential of acoustic features and the impact of transcription quality on diagnostic performance.
Contribution
It analyzes the influence of state-of-the-art ASR systems and language models on Alzheimer's detection, proposing a combined acoustic and lexical feature approach for improved accuracy.
Findings
Automatic transcripts without language models achieved 76.06% accuracy.
Using language models reduced accuracy by about 3%.
Acoustic features contribute significantly to detection performance.
Abstract
Alzheimer's Disease is the most common form of dementia. Automatic detection from speech could help to identify symptoms at early stages, so that preventive actions can be carried out. This research is a contribution to the ADReSSo Challenge, we analyze the usage of a SotA ASR system to transcribe participant's spoken descriptions from a picture. We analyse the loss of performance regarding the use of human transcriptions (measured using transcriptions from the 2020 ADReSS Challenge). Furthermore, we study the influence of a language model -- which tends to correct non-standard sequences of words -- with the lack of language model to decode the hypothesis from the ASR. This aims at studying the language bias and get more meaningful transcriptions based only on the acoustic information from patients. The proposed system combines acoustic -- based on prosody and voice quality -- and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Voice and Speech Disorders · Music and Audio Processing
