# Enhanced diagnostic interpretation of the MoCA using machine learning

**Authors:** Christian Gourdeau, Charles L. Gourdeau, Patrick J. Bernier, Robert Laforce

PMC · DOI: 10.3389/fnins.2026.1679649 · Frontiers in Neuroscience · 2026-02-20

## TL;DR

This paper shows that machine learning can improve the MoCA cognitive test's ability to detect dementia by analyzing detailed scores and patient data.

## Contribution

The study introduces interpretable machine learning models that enhance MoCA diagnostic accuracy and subtype classification beyond traditional cutoffs.

## Key findings

- XGBoost outperformed standard MoCA cutoffs with a Youden Index of 0.61 for cognitive impairment detection.
- Primary progressive aphasia was classified best (Youden ≈ 0.77), while vascular dementia remained challenging.
- Cognitive Quotient was a universal predictor, with disease-specific features like delayed recall for Alzheimer’s.

## Abstract

Artificial Intelligence (AI) is increasingly being integrated into clinical practice to optimize diagnosis in neurocognition. By capturing distinct cognitive signatures, this approach may offer a more precise alternative to the traditional interpretation of the Montreal Cognitive Assessment (MoCA) which often relies on a fixed cutoff score (26/30). We aimed to evaluate whether machine learning models, by integrating detailed MoCA subtest scores, demographic variables, and cognitive chart-derived metrics, can improve the detection of cognitive impairment and classification of dementia subtypes.

We analyzed 38,746 clinical observations (17,188 unique individuals) from the National Alzheimer’s Coordinating Center database. Five supervised learning algorithms, Extreme Gradient Boosting (XGBoost), Random Forest, Support Vector Machine (SVM), Logistic Regression, and k-Nearest Neighbors (KNN), were trained using detailed MoCA subtest scores, demographic variables, and cognitive chart-derived metrics as predictors. To ensure generalizability of results and prevent data leakage, we applied a rigorous nested Repeated Grouped Cross-Validation strategy. Decision thresholds were optimized via the Youden Index on independent calibration sets, and model interpretability was ensured through SHAP value analysis.

Machine learning models consistently outperformed conventional approach. For the global detection of cognitive impairment, XGBoost achieved the best performance (Youden Index 0.61 vs. 0.54 for the standard cutoff). Regarding subtype classification, models demonstrated variable discriminative capacity depending on clinical homogeneity: primary progressive aphasia was best classified (Youden ≈ 0.77), followed by Lewy body dementia and Alzheimer’s disease, while vascular dementia remained more challenging to isolate. Feature importance analysis highlighted the Cognitive Quotient as a robust universal predictor, while pinpointing disease-specific drivers such as delayed recall for Alzheimer’s disease and verbal fluency for primary progressive aphasia.

Our findings suggest interpretable machine learning enhances diagnostic utility of the MoCA, yielding superior accuracy compared to a fixed cutoff. By synthesizing individualized subtest profiles within a transparent framework, this approach offers a clinically actionable solution. It transforms the MoCA from a simple screening tool to a precision diagnostic aid, optimizing patient triage in the era of disease-modifying therapies.

## Linked entities

- **Diseases:** Alzheimer’s disease (MONDO:0004975), Lewy body dementia (MONDO:0007488), vascular dementia (MONDO:0004648), primary progressive aphasia (MONDO:0019806)

## Full-text entities

- **Genes:** SHROOM4 (shroom family member 4) [NCBI Gene 57477] {aka MRXSSDS, SHAP, shrm4}, DOCK3 (dedicator of cytokinesis 3) [NCBI Gene 1795] {aka MOCA, NEDIDHA, PBP}
- **Diseases:** dysexecutive symptoms (MESH:D012816), VD (MESH:D015140), executive and frontal lobe related impairment (MESH:D001927), deficits in attention and executive functions (MESH:D001289), neurocognitive disorder (MESH:D019965), aphasia (MESH:D001037), fluency (MESH:D013064), amyloid accumulation (MESH:C000718787), dementia (MESH:D003704), dysfunction (MESH:D006331), visuospatial (MESH:D000377), Frontotemporal Dementia (MESH:D057180), memory impairment (MESH:D008569), Cognitive Impairment (MESH:D003072), PPA (MESH:D018888), neurodegeneration (MESH:D019636), MCI (MESH:D060825), AD (MESH:D000544), DLB (MESH:D020961), AI (MESH:C538142), stroke (MESH:D020521), deficits in repetition and (MESH:D009461), body (MESH:D001835)
- **Chemicals:** donanemab (-), lecanemab (MESH:C000612089)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12963294/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12963294/full.md

## References

47 references — full list in the complete paper: https://tomesphere.com/paper/PMC12963294/full.md

---
Source: https://tomesphere.com/paper/PMC12963294