AI-Driven Acoustic Voice Biomarker-Based Hierarchical Classification of Benign Laryngeal Voice Disorders from Sustained Vowels
Mohsen Annabestani, Samira Aghadoost, Anais Rameau, Olivier Elemento, Gloria Chia-Yi Chiang

TL;DR
This study presents a hierarchical machine learning framework that uses acoustic features from sustained vowels to accurately classify benign laryngeal voice disorders, aiming to improve early diagnosis and monitoring.
Contribution
The paper introduces a novel hierarchical classification system combining deep spectral features and interpretable biomarkers, optimized for clinical phonation analysis.
Findings
Outperforms flat classifiers and pre-trained models in accuracy.
Effectively stratifies voice disorders into clinically relevant categories.
Enhances transparency and clinical relevance of automated voice disorder diagnosis.
Abstract
Benign laryngeal voice disorders affect nearly one in five individuals and often manifest as dysphonia, while also serving as non-invasive indicators of broader physiological dysfunction. We introduce a clinically inspired hierarchical machine learning framework for automated classification of eight benign voice disorders alongside healthy controls, using acoustic features extracted from short, sustained vowel phonations. Experiments utilized 15,132 recordings from 1,261 speakers in the Saarbruecken Voice Database, covering vowels /a/, /i/, and /u/ at neutral, high, low, and gliding pitches. Mirroring clinical triage workflows, the framework operates in three sequential stages: Stage 1 performs binary screening of pathological versus non-pathological voices by integrating convolutional neural network-derived mel-spectrogram features with 21 interpretable acoustic biomarkers; Stage 2…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVoice and Speech Disorders · Phonocardiography and Auscultation Techniques · Respiratory and Cough-Related Research
