# Developing a large scale population screening tool for the assessment of   Parkinson's disease using telephone-quality voice

**Authors:** Siddharth Arora, Ladan Baghai-Ravary, Athanasios Tsanas

arXiv: 1905.00377 · 2019-05-22

## TL;DR

This study develops a large-scale, telephone-based voice analysis tool using machine learning to differentiate Parkinson's disease patients from healthy controls, aiming for accessible remote screening.

## Contribution

It introduces a novel statistical framework with 307 dysphonia measures and demonstrates effective classification using Random Forests on diverse, large-scale telephone-quality voice data.

## Key findings

- Mean sensitivity of 64.9% in PD detection
- Mean specificity of 67.96% in healthy controls
- Validated robustness across seven countries

## Abstract

Recent studies have demonstrated that analysis of laboratory-quality voice recordings can be used to accurately differentiate people diagnosed with Parkinson's disease (PD) from healthy controls (HC). These findings could help facilitate the development of remote screening and monitoring tools for PD. In this study, we analyzed 2759 telephone-quality voice recordings from 1483 PD and 15321 recordings from 8300 HC participants. To account for variations in phonetic backgrounds, we acquired data from seven countries. We developed a statistical framework for analyzing voice, whereby we computed 307 dysphonia measures that quantify different properties of voice impairment, such as, breathiness, roughness, monopitch, hoarse voice quality, and exaggerated vocal tremor. We used feature selection algorithms to identify robust parsimonious feature subsets, which were used in combination with a Random Forests (RF) classifier to accurately distinguish PD from HC. The best 10-fold cross-validation performance was obtained using Gram-Schmidt Orthogonalization (GSO) and RF, leading to mean sensitivity of 64.90% (standard deviation, SD 2.90%) and mean specificity of 67.96% (SD 2.90%). This large-scale study is a step forward towards assessing the development of a reliable, cost-effective and practical clinical decision support tool for screening the population at large for PD using telephone-quality voice.

---
Source: https://tomesphere.com/paper/1905.00377