Parkinson's disease detection using spectrogram-based multi-model feature fusion networks
Wenna Chen, Rongfu Lv, Xiaowei Du, Xiangyu Chen, Hao Wang, Jincan Zhang, Ganqin Du

TL;DR
This paper introduces a non-invasive method for detecting Parkinson's disease using voice recordings and deep learning models that combine features from multiple neural networks.
Contribution
The novel contribution is a multi-model feature fusion approach using spectrograms and pre-trained CNNs to improve PD detection accuracy.
Findings
The fusion of MobileNetV3-Large and ShuffleNetV2 achieved 95.56% accuracy and an AUC of 0.99 in PD classification.
Feature fusion outperformed individual models in all evaluation metrics using 5-fold cross-validation.
The method effectively captures subtle speech patterns indicative of Parkinson's disease.
Abstract
Parkinson's disease (PD) is a common neurodegenerative disorder. Traditional diagnostic methods, relying on clinical assessment and imaging, are often invasive, costly, and require specialized personnel, posing barriers to early detection. As approximately 90% of PD patients develop vocal impairments, vocal analysis emerges as a promising non-invasive diagnostic tool. However, individual deep learning models are often limited by overfitting and poor generalizability. This study proposes a PD classification method using spectrogram feature fusion with pre-trained convolutional neural networks (CNNs). Voice recordings were obtained from 61 PD patients and 70 healthy controls (HC) at the First Affiliated Hospital of Henan University of Science and Technology. Preprocessing the raw speech signals yielded 2,476 spectrograms. Three pre-trained models, DenseNet121, MobileNetV3-Large, and…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVoice and Speech Disorders · Parkinson's Disease Mechanisms and Treatments · Speech Recognition and Synthesis
