Alzheimer's Dementia Detection from Audio and Text Modalities

Edward L. Campbell (1); Laura Doc\'io-Fern\'andez (1); Javier; Jim\'enez Raboso (2); Carmen Garc\'ia-Mateo (1) ((1) GTM research group,; AtlanTTic Research Center; University of Vigo; (2) acceXible)

arXiv:2008.04617·eess.AS·August 12, 2020·1 cites

Alzheimer's Dementia Detection from Audio and Text Modalities

Edward L. Campbell (1), Laura Doc\'io-Fern\'andez (1), Javier, Jim\'enez Raboso (2), Carmen Garc\'ia-Mateo (1) ((1) GTM research group,, AtlanTTic Research Center, University of Vigo, (2) acceXible)

PDF

Open Access

TL;DR

This paper explores multimodal approaches combining audio and text analysis to improve automatic detection of Alzheimer's dementia, demonstrating promising results that outperform baseline systems.

Contribution

It introduces and compares six systems utilizing both speech and transcription features, including novel rhythmic and language modeling features, for Alzheimer's detection.

Findings

01

Multimodal systems outperform single modality approaches.

02

Rhythmic features improve detection of speech fluency issues.

03

Fusion strategies enhance overall detection performance.

Abstract

Automatic detection of Alzheimer's dementia by speech processing is enhanced when features of both the acoustic waveform and the content are extracted. Audio and text transcription have been widely used in health-related tasks, as spectral and prosodic speech features, as well as semantic and linguistic content, convey information about various diseases. Hence, this paper describes the joint work of the GTM-UVIGO research group and acceXible startup to the ADDReSS challenge at INTERSPEECH 2020. The submitted systems aim to detect patterns of Alzheimer's disease from both the patient's voice and message transcription. Six different systems have been built and compared: four of them are speech-based and the other two systems are text-based. The x-vector, i-vector, and statistical speech-based functionals features are evaluated. As a lower speaking fluency is a common pattern in patients…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Music and Audio Processing · Voice and Speech Disorders