# On the Suitability of Data Augmentation Techniques to Improve Parkinson’s Disease Detection with Speech Recordings

**Authors:** Cristian David Ríos-Urrego, Tulio Andrés Ruiz-Romero, David Puerta-Lotero, Daniel Escobar-Grisales, Juan Rafael Orozco-Arroyave

PMC · DOI: 10.3390/diagnostics16030498 · Diagnostics · 2026-02-06

## TL;DR

This paper explores how data augmentation improves Parkinson's disease detection from speech but finds limited generalization across datasets.

## Contribution

The study evaluates data augmentation techniques for PD detection and highlights their limited effectiveness in generalization.

## Key findings

- Data augmentation improves classification accuracy by up to 3% on the same dataset.
- Improvements do not consistently translate to better performance on independent datasets.
- Augmentation enhances model robustness within a dataset but not generalization.

## Abstract

Background: Parkinson’s disease (PD) is a neurodegenerative disorder that affects millions of people worldwide. Speech analysis has emerged as a non-invasive tool for automatic PD detection; however, the scarcity and homogeneity of available datasets often limit the generalization capability of machine learning models, motivating the use of data augmentation strategies to improve robustness. Methods: This study presents a data augmentation-based methodology for speech-based classification between PD patients and healthy control subjects. A deep learning model trained from scratch on Mel spectrograms is evaluated using augmentation techniques applied at both the waveform and time–frequency levels. Multiple training and model selection strategies are analyzed and model performance is assessed through internal validation as well as using an independent dataset Results: Experimental results show that carefully selected data augmentation techniques improve classification performance with respect to the non-augmented counterpart, achieving gains of up to 3% in accuracy. However, when evaluated on an independent dataset, these improvements do not consistently translate into better generalization. Conclusions: These findings demonstrate that, while data augmentation can effectively enhance model performance within a single dataset, this apparent robustness is not sufficient to guarantee generalization on independent speech corpora for PD detection.

## Linked entities

- **Diseases:** Parkinson’s disease (MONDO:0005180)

## Full-text entities

- **Diseases:** PD (MESH:D010300), neurodegenerative disorder (MESH:D019636)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12896770/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12896770/full.md

## References

40 references — full list in the complete paper: https://tomesphere.com/paper/PMC12896770/full.md

---
Source: https://tomesphere.com/paper/PMC12896770