Machine-learning applied to classify flow-induced sound parameters from simulated human voice
Florian Kraxberger, Andreas Wurzinger, Stefan Schoder

TL;DR
This study uses simulated human voice data and machine learning techniques to classify flow-induced sound parameters, aiming to better understand voice disorders and improve diagnostic features.
Contribution
It introduces a hybrid aeroacoustic simulation model and applies machine learning to classify voice disorder characteristics from simulated acoustic parameters.
Findings
Subglottal pressure classified with 91.7% accuracy.
CPP and GC type are key discriminators.
LDA visualization reveals important acoustic features.
Abstract
Disorders of voice production have severe effects on the quality of life of the affected individuals. A simulation approach is used to investigate the cause-effect chain in voice production showing typical characteristics of voice such as sub-glottal pressure and of functional voice disorders as glottal closure insufficiency and left-right asymmetry. Therewith, 24 different voice configurations are simulated in a parameter study using a previously published hybrid aeroacoustic simulation model. Based on these 24 simulation configurations, selected acoustic parameters (HNR, CPP, ...) at simulation evaluation points are correlated with these simulation configuration details to derive characteristic insight in the flow-induced sound generation of human phonation based on simulation results. Recently, several institutions studied experimental data, of flow and acoustic properties and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVoice and Speech Disorders · Speech Recognition and Synthesis · Phonetics and Phonology Research
