CogniVoice: Multimodal and Multilingual Fusion Networks for Mild   Cognitive Impairment Assessment from Spontaneous Speech

Jiali Cheng; Mohamed Elgaar; Nidhi Vakil; Hadi Amiri

arXiv:2407.13660·cs.LG·July 19, 2024

CogniVoice: Multimodal and Multilingual Fusion Networks for Mild Cognitive Impairment Assessment from Spontaneous Speech

Jiali Cheng, Mohamed Elgaar, Nidhi Vakil, Hadi Amiri

PDF

Open Access 1 Repo

TL;DR

CogniVoice is a novel multimodal and multilingual framework that effectively detects Mild Cognitive Impairment and estimates MMSE scores from speech data, outperforming baseline models across languages.

Contribution

Introduces CogniVoice, a new ensemble multimodal and multilingual network based on 'Product of Experts' for MCI detection and MMSE estimation from speech.

Findings

01

Outperforms baseline models in MCI classification and MMSE regression.

02

Reduces performance gap across different languages.

03

Achieves 2.8 F1 points improvement in classification.

Abstract

Mild Cognitive Impairment (MCI) is a medical condition characterized by noticeable declines in memory and cognitive abilities, potentially affecting individual's daily activities. In this paper, we introduce CogniVoice, a novel multilingual and multimodal framework to detect MCI and estimate Mini-Mental State Examination (MMSE) scores by analyzing speech data and its textual transcriptions. The key component of CogniVoice is an ensemble multimodal and multilingual network based on ``Product of Experts'' that mitigates reliance on shortcut solutions. Using a comprehensive dataset containing both English and Chinese languages from TAUKADIAL challenge, CogniVoice outperforms the best performing baseline model on MCI classification and MMSE regression tasks by 2.8 and 4.1 points in F1 and RMSE respectively, and can effectively reduce the performance gap across different language groups by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

CLU-UML/CogniVoice
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeurobiology of Language and Bilingualism