Towards a Generalizable Speech Marker for Parkinson's Disease Diagnosis
Maksim Siniukov, Ellie Xing, Sanaz Attaripour Isfahani and, Mohammad Soleymani

TL;DR
This paper introduces a domain adaptation and self-supervised learning approach using HuBERT to create a generalizable, non-invasive speech-based diagnostic tool for Parkinson's Disease across multiple languages and datasets.
Contribution
It presents a novel method combining domain adaptation and self-supervised learning with HuBERT for cross-lingual PD detection, improving generalization and robustness.
Findings
Achieved over 92% specificity and 91% sensitivity across datasets.
Demonstrated effective generalization across English, Italian, and Spanish datasets.
Provided a cost-effective, non-invasive diagnostic alternative.
Abstract
Parkinson's Disease (PD) is a neurodegenerative disorder characterized by motor symptoms, including altered voice production in the early stages. Early diagnosis is crucial not only to improve PD patients' quality of life but also to enhance the efficacy of potential disease-modifying therapies during early neurodegeneration, a window often missed by current diagnostic tools. In this paper, we propose a more generalizable approach to PD recognition through domain adaptation and self-supervised learning. We demonstrate the generalization capabilities of the proposed approach across diverse datasets in different languages. Our approach leverages HuBERT, a large deep neural network originally trained for speech recognition and further trains it on unlabeled speech data from a population that is similar to the target group, i.e., the elderly, in a self-supervised manner. The model is then…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVoice and Speech Disorders · Speech Recognition and Synthesis
