Multi-class Detection of Pathological Speech with Latent Features: How does it perform on unseen data?
Dominik Wagner, Ilja Baumann, Franziska Braun, Sebastian P. Bayerl,, Elmar N\"oth, Korbinian Riedhammer, Tobias Bocklet

TL;DR
This study explores multi-class classification of pathological speech using latent features from wav2vec 2.0, demonstrating robust performance on unseen data and various noise conditions.
Contribution
It introduces a multi-class approach leveraging latent features from wav2vec 2.0 for detecting multiple speech pathologies, extending beyond binary classification.
Findings
Achieves unweighted average F1-Scores between 74.1% and 97.0%.
Classifiers generalize well to unseen healthy speech data.
Robustness confirmed under noisy conditions with added room impulse responses.
Abstract
The detection of pathologies from speech features is usually defined as a binary classification task with one class representing a specific pathology and the other class representing healthy speech. In this work, we train neural networks, large margin classifiers, and tree boosting machines to distinguish between four pathologies: Parkinson's disease, laryngeal cancer, cleft lip and palate, and oral squamous cell carcinoma. We show that latent representations extracted at different layers of a pre-trained wav2vec 2.0 system can be effectively used to classify these types of pathological voices. We evaluate the robustness of our classifiers by adding room impulse responses to the test data and by applying them to unseen speech corpora. Our approach achieves unweighted average F1-Scores between 74.1% and 97.0%, depending on the model and the noise conditions used. The systems generalize…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVoice and Speech Disorders · Head and Neck Cancer Studies · Dysphagia Assessment and Management
MethodsTest
