Augmenting Open-Vocabulary Dysarthric Speech Assessment with Human Perceptual Supervision
Kaimeng Jia, Minzhu Tu, Zengrui Jin, Siyin Wang, Chao Zhang

TL;DR
This paper explores using human perceptual ratings from speech synthesis assessments to improve automatic dysarthria speech evaluation, demonstrating that such supervision enhances model performance and cross-domain knowledge transfer.
Contribution
It introduces a novel approach of leveraging perceptual annotations from speech synthesis as supervision for dysarthric speech assessment, improving model accuracy.
Findings
Perceptual supervision improves dysarthria assessment accuracy.
Human-aligned ratings enhance cross-domain speech modeling.
Self-supervised models benefit from perceptual annotations.
Abstract
Dysarthria is a speech disorder characterized by impaired intelligibility and reduced communicative effectiveness. Automatic dysarthria assessment provides a scalable, cost-effective approach for supporting the diagnosis and treatment of neurological conditions such as Parkinson's disease, Alzheimer's disease, and stroke. This study investigates leveraging human perceptual annotations from speech synthesis assessment as reliable out-of-domain knowledge for dysarthric speech assessment. Experimental results suggest that such supervision can yield consistent and substantial performance improvements in self-supervised learning pre-trained models. These findings suggest that perceptual ratings aligned with human judgments from speech synthesis evaluations represent valuable resources for dysarthric speech modeling, enabling effective cross-domain knowledge transfer.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVoice and Speech Disorders · Speech Recognition and Synthesis · Phonocardiography and Auscultation Techniques
