# An Explainable Ensemble and Deep Learning Framework for Accurate and Interpretable Parkinson’s Disease Detection from Voice Biomarkers

**Authors:** Suliman Aladhadh

PMC · DOI: 10.3390/diagnostics15222892 · 2025-11-14

## TL;DR

This paper introduces an AI framework that accurately detects Parkinson’s disease from voice biomarkers while providing clear explanations for its predictions.

## Contribution

The novel contribution is a unified and explainable AI framework combining ensemble and deep learning models for Parkinson’s detection with transparent interpretability.

## Key findings

- LightGBM and Random Forest achieved state-of-the-art accuracy (98.01%) and ROC-AUC (0.9914) for Parkinson’s detection.
- Nonlinear acoustic biomarkers like spread2, PPE, and RPDE were identified as the most influential predictors.
- Deep learning models like CNN and GAN showed competitive performance in capturing voice patterns related to Parkinson’s disease.

## Abstract

Background: Parkinson’s disease (PD) is a degenerative neurological disorder that greatly affects motor and speech functions; therefore, early diagnosis is vital for improving patients’ quality of life. This work introduces a unified and explainable AI framework for PD detection that integrates ensemble and deep learning models with transparent interpretability techniques. Methods: Acoustic features were extracted from the Parkinson’s Voice Disorder Dataset, and a broad suite of machine learning and deep learning models was evaluated, including traditional classifiers (Logistic Regression, Decision Tree, KNN, Linear Regression, SVM), ensemble methods (Random Forest, Gradient Boosting, XGBoost, LightGBM), and neural architectures (CNN, LSTM, GAN). Results: The ensemble methods—specifically LightGBM (LGBM) and Random Forest (RF)—achieved the best performance, reaching state-of-the-art accuracy (98.01%) and ROC-AUC (0.9914). Deep learning models like CNN and GAN produced competitive results, validating their ability to capture nonlinear and generative voice patterns. XAI analysis revealed that nonlinear acoustic biomarkers such as spread2, PPE, and RPDE are the most influential predictors, consistent with clinical evidence of dysphonia in PD. Conclusions: The proposed framework achieves a strong balance between predictive accuracy and interpretability, representing a clinically relevant, scalable, and non-invasive solution for early Parkinson’s detection.

## Linked entities

- **Diseases:** Parkinson’s disease (MONDO:0005180)

## Full-text entities

- **Diseases:** Parkinson's Voice Disorder (MESH:D014832), dysphonia (MESH:D055154), PD (MESH:D010300), degenerative neurological disorder (MESH:D019636)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12651050/full.md

---
Source: https://tomesphere.com/paper/PMC12651050