Machine Learning Classification Model Performance in Detecting Cognitive Impairments From Multimodal Embeddings
W Quin Yow, Adharsha Sam Edwin Sam Devahi, Lihan Zuo, Ka Lon Sou

TL;DR
This study explores how combining speech data features affects machine learning models for detecting cognitive impairments, finding that acoustic features alone perform best.
Contribution
The paper compares SVM model performance using acoustic, linguistic, and combined embeddings for cognitive impairment detection.
Findings
Acoustic embeddings alone achieved the highest validation accuracy (72%) compared to linguistic and combined embeddings.
Hyperparameter tuning did not prevent overfitting, as training accuracy was significantly higher than validation accuracy.
Combined acoustic and linguistic embeddings did not improve model performance over acoustic embeddings alone.
Abstract
In the fast-evolving field of AI for healthcare, there is a growing trend of combining acoustic and linguistic features from speech data to improve ML model performance for cognitive impairment detection. This paper evaluates a hyperparameter-tuned SVM model trained on three feature sets derived from pretrained Transformer-based models: acoustic CrisperWhisper embeddings, linguistic BERT embeddings, and both. Using 150 data points (94F, ages 51-99) equally distributed across healthy, MCI, and dementia classes, sourced from DementiaBank and challenge datasets from the University of Edinburgh and Carnegie Mellon University, no statistically significant differences (χ²=3.45, p = 0.178) were found in the accuracy scores of the model trained on the three feature sets across 30 validation datasets. The mean validation accuracy scores of the model trained on acoustic, linguistic, and combined…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Mental Health via Writing · Dementia and Cognitive Impairment Research
