# Applying Multiple Machine Learning Models to Classify Mild Cognitive Impairment from Speech in Community-Dwelling Older Adults

**Authors:** Renqing Zhao, Zhiyuan Zhu, Zihui Huang

PMC · DOI: 10.3390/jintelligence14020017 · 2026-01-26

## TL;DR

This study uses speech analysis and machine learning to accurately detect early signs of cognitive impairment in older adults.

## Contribution

The novel use of optimized speech features with multiple machine learning models improves MCI detection accuracy.

## Key findings

- The XGBoost model achieved the highest accuracy (0.92) in distinguishing MCI from healthy controls.
- Speech feature extraction using Librosa and SFS optimization significantly improved model performance.
- Machine learning models based on speech data can reliably identify early cognitive impairment.

## Abstract

This study aims to develop effective screening tools for cognitive impairment by integrating optimised speech classification features with various machine learning models. A total of 65 patients diagnosed with early-stage Mild Cognitive Impairment (MCI) and 55 healthy controls (HCs) were included. Audio data were collected through a picture description task and processed using the Python-based Librosa library for speech feature extraction. Three machine learning models were constructed: the Random Forest (RF) and Support Vector Machine (SVM) models utilised speech classification features optimised via the Sequential Forward Selection (SFS) algorithm, while the Extreme Gradient Boosting (XGBoost) model was trained on preprocessed speech data. After parameter tuning, the Librosa library successfully extracted 41 speech classification features from all participants. The application of the SFS optimisation strategy and the use of preprocessed data significantly improved identification accuracy. The SVM model achieved an accuracy of 0.825 (AUC: 0.91), the RF model reached 0.88 (AUC: 0.86), and the XGBoost model attained 0.92 (AUC: 0.91). These results suggest that speech-based machine learning models markedly improve the accuracy of distinguishing MCI patients from healthy older adults, providing reliable support for early cognitive deficit identification.

## Full-text entities

- **Diseases:** language deficits (MESH:D007806), dementia (MESH:D003704), neurological disorders (MESH:D009461), hearing problems (MESH:D034381), Cognitive Impairment (MESH:D003072), injury to (MESH:D014947), neurodegenerative disease (MESH:D019636), PD (MESH:D010300), neuropsychiatric illness (MESH:C000631768), substance abuse (MESH:D019966), MCI (MESH:D060825), AD (MESH:D000544), Aphasia (MESH:D001037)
- **Chemicals:** Fluorodeoxyglucose (MESH:D019788)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12941918/full.md

---
Source: https://tomesphere.com/paper/PMC12941918