Alzheimer's Dementia Detection from Audio and Text Modalities
Edward L. Campbell (1), Laura Doc\'io-Fern\'andez (1), Javier, Jim\'enez Raboso (2), Carmen Garc\'ia-Mateo (1) ((1) GTM research group,, AtlanTTic Research Center, University of Vigo, (2) acceXible)

TL;DR
This paper explores multimodal approaches combining audio and text analysis to improve automatic detection of Alzheimer's dementia, demonstrating promising results that outperform baseline systems.
Contribution
It introduces and compares six systems utilizing both speech and transcription features, including novel rhythmic and language modeling features, for Alzheimer's detection.
Findings
Multimodal systems outperform single modality approaches.
Rhythmic features improve detection of speech fluency issues.
Fusion strategies enhance overall detection performance.
Abstract
Automatic detection of Alzheimer's dementia by speech processing is enhanced when features of both the acoustic waveform and the content are extracted. Audio and text transcription have been widely used in health-related tasks, as spectral and prosodic speech features, as well as semantic and linguistic content, convey information about various diseases. Hence, this paper describes the joint work of the GTM-UVIGO research group and acceXible startup to the ADDReSS challenge at INTERSPEECH 2020. The submitted systems aim to detect patterns of Alzheimer's disease from both the patient's voice and message transcription. Six different systems have been built and compared: four of them are speech-based and the other two systems are text-based. The x-vector, i-vector, and statistical speech-based functionals features are evaluated. As a lower speaking fluency is a common pattern in patients…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Music and Audio Processing · Voice and Speech Disorders
