Uncovering Voice Misuse Using Symbolic Mismatch

Marzyeh Ghassemi; Zeeshan Syed; Daryush D. Mehta; Jarrad H. Van Stan,; Robert E. Hillman; and John V. Guttag

arXiv:1608.02301·cs.LG·August 9, 2016·2 cites

Uncovering Voice Misuse Using Symbolic Mismatch

Marzyeh Ghassemi, Zeeshan Syed, Daryush D. Mehta, Jarrad H. Van Stan,, Robert E. Hillman, and John V. Guttag

PDF

Open Access

TL;DR

This study introduces an unsupervised, large-scale data mining approach using accelerometer data to detect vocal misuse, revealing behavioral differences in voice disorder patients and aiding diagnosis and treatment evaluation.

Contribution

The paper presents the first large-scale analysis of vocal misuse using long-term accelerometer data and symbolic mismatch, offering an objective, data-driven method for voice disorder assessment.

Findings

01

Significant behavioral differences between patients and controls.

02

Detectable pre- and post-treatment differences.

03

Unsupervised symbolic mismatch effectively uncovers voice misuse patterns.

Abstract

Voice disorders affect an estimated 14 million working-aged Americans, and many more worldwide. We present the first large scale study of vocal misuse based on long-term ambulatory data collected by an accelerometer placed on the neck. We investigate an unsupervised data mining approach to uncovering latent information about voice misuse. We segment signals from over 253 days of data from 22 subjects into over a hundred million single glottal pulses (closures of the vocal folds), cluster segments into symbols, and use symbolic mismatch to uncover differences between patients and matched controls, and between patients pre- and post-treatment. Our results show significant behavioral differences between patients and controls, as well as between some pre- and post-treatment patients. Our proposed approach provides an objective basis for helping diagnose behavioral voice disorders, and is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Voice and Speech Disorders · Speech Recognition and Synthesis