Combining Prosodic, Voice Quality and Lexical Features to Automatically Detect Alzheimer's Disease
Mireia Farr\'us, Joan Codina-Filb\`a

TL;DR
This study explores combining prosodic, voice quality, and lexical features from spontaneous speech to improve automatic detection of Alzheimer's Disease, achieving high classification accuracy and low regression error.
Contribution
It introduces a multi-feature approach for AD detection using speech and lexical data, advancing non-intrusive diagnostic methods.
Findings
87.5% classification accuracy with Random Forest
4.54 RMSE in MMSE score regression
Effective use of prosody, voice quality, and lexical features
Abstract
Alzheimer's Disease (AD) is nowadays the most common form of dementia, and its automatic detection can help to identify symptoms at early stages, so that preventive actions can be carried out. Moreover, non-intrusive techniques based on spoken data are crucial for the development of AD automatic detection systems. In this light, this paper is presented as a contribution to the ADReSS Challenge, aiming at improving AD automatic detection from spontaneous speech. To this end, recordings from 108 participants, which are age-, gender-, and AD condition-balanced, have been used as training set to perform two different tasks: classification into AD/non-AD conditions, and regression over the Mini-Mental State Examination (MMSE) scores. Both tasks have been performed extracting 28 features from speech -- based on prosody and voice quality -- and 51 features from the transcriptions -- based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Voice and Speech Disorders · Phonetics and Phonology Research
MethodsLinear Regression
