Influence of ASR and Language Model on Alzheimer's Disease Detection

Joan Codina-Filb\`a; Guillermo C\'ambara; Jordi Luque; Mireia; Farr\'us

arXiv:2110.15704·cs.CL·November 1, 2021

Influence of ASR and Language Model on Alzheimer's Disease Detection

Joan Codina-Filb\`a, Guillermo C\'ambara, Jordi Luque, Mireia, Farr\'us

PDF

Open Access

TL;DR

This paper investigates how automatic speech recognition and language models affect Alzheimer's detection accuracy from speech, highlighting the potential of acoustic features and the impact of transcription quality on diagnostic performance.

Contribution

It analyzes the influence of state-of-the-art ASR systems and language models on Alzheimer's detection, proposing a combined acoustic and lexical feature approach for improved accuracy.

Findings

01

Automatic transcripts without language models achieved 76.06% accuracy.

02

Using language models reduced accuracy by about 3%.

03

Acoustic features contribute significantly to detection performance.

Abstract

Alzheimer's Disease is the most common form of dementia. Automatic detection from speech could help to identify symptoms at early stages, so that preventive actions can be carried out. This research is a contribution to the ADReSSo Challenge, we analyze the usage of a SotA ASR system to transcribe participant's spoken descriptions from a picture. We analyse the loss of performance regarding the use of human transcriptions (measured using transcriptions from the 2020 ADReSS Challenge). Furthermore, we study the influence of a language model -- which tends to correct non-standard sequences of words -- with the lack of language model to decode the hypothesis from the ASR. This aims at studying the language bias and get more meaningful transcriptions based only on the acoustic information from patients. The proposed system combines acoustic -- based on prosody and voice quality -- and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Voice and Speech Disorders · Music and Audio Processing