Speaker-Independent Dysarthria Severity Classification using   Self-Supervised Transformers and Multi-Task Learning

Lauren Stumpf; Balasundaram Kadirvelu; Sigourney Waibel; A.; Aldo Faisal

arXiv:2403.00854·q-bio.NC·March 5, 2024·1 cites

Speaker-Independent Dysarthria Severity Classification using Self-Supervised Transformers and Multi-Task Learning

Lauren Stumpf, Balasundaram Kadirvelu, Sigourney Waibel, A., Aldo Faisal

PDF

Open Access

TL;DR

This paper introduces a transformer-based, speaker-independent framework called SALR for automatic dysarthria severity classification from raw speech, outperforming traditional methods and establishing new benchmarks in clinical speech assessment.

Contribution

The study presents a novel SALR transformer framework with multi-task and contrastive learning for speaker-independent dysarthria severity classification, improving accuracy and robustness over prior approaches.

Findings

01

Achieved 70.48% accuracy and 59.23% F1 score on the Universal Access Speech dataset.

02

Exceeded previous SVM benchmark by 16.58%.

03

Visualized latent space to demonstrate reduced speaker-specific cues.

Abstract

Dysarthria, a condition resulting from impaired control of the speech muscles due to neurological disorders, significantly impacts the communication and quality of life of patients. The condition's complexity, human scoring and varied presentations make its assessment and management challenging. This study presents a transformer-based framework for automatically assessing dysarthria severity from raw speech data. It can offer an objective, repeatable, accessible, standardised and cost-effective and compared to traditional methods requiring human expert assessors. We develop a transformer framework, called Speaker-Agnostic Latent Regularisation (SALR), incorporating a multi-task learning objective and contrastive learning for speaker-independent multi-class dysarthria severity classification. The multi-task framework is designed to reduce reliance on speaker-specific characteristics and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVoice and Speech Disorders · Speech Recognition and Synthesis

MethodsContrastive Learning