Developing a large scale population screening tool for the assessment of Parkinson's disease using telephone-quality voice
Siddharth Arora, Ladan Baghai-Ravary, Athanasios Tsanas

TL;DR
This study develops a large-scale, telephone-based voice analysis tool using machine learning to differentiate Parkinson's disease patients from healthy controls, aiming for accessible remote screening.
Contribution
It introduces a novel statistical framework with 307 dysphonia measures and demonstrates effective classification using Random Forests on diverse, large-scale telephone-quality voice data.
Findings
Mean sensitivity of 64.9% in PD detection
Mean specificity of 67.96% in healthy controls
Validated robustness across seven countries
Abstract
Recent studies have demonstrated that analysis of laboratory-quality voice recordings can be used to accurately differentiate people diagnosed with Parkinson's disease (PD) from healthy controls (HC). These findings could help facilitate the development of remote screening and monitoring tools for PD. In this study, we analyzed 2759 telephone-quality voice recordings from 1483 PD and 15321 recordings from 8300 HC participants. To account for variations in phonetic backgrounds, we acquired data from seven countries. We developed a statistical framework for analyzing voice, whereby we computed 307 dysphonia measures that quantify different properties of voice impairment, such as, breathiness, roughness, monopitch, hoarse voice quality, and exaggerated vocal tremor. We used feature selection algorithms to identify robust parsimonious feature subsets, which were used in combination with a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
