Cross-lingual Dysarthria Severity Classification for English, Korean, and Tamil
Eun Jung Yeo, Kwanghee Choi, Sunhee Kim, Minhwa Chung

TL;DR
This study introduces a cross-lingual dysarthria severity classification method using language-independent and language-specific features, validated across English, Korean, and Tamil, outperforming mono-lingual approaches.
Contribution
The paper presents a novel cross-lingual classification approach that effectively combines shared and unique speech features for multiple languages, improving severity detection accuracy.
Findings
Achieved 67.14% F1 score, outperforming baseline methods.
Improved classification performance for all three languages.
Separating shared and language-specific features enhances accuracy.
Abstract
This paper proposes a cross-lingual classification method for English, Korean, and Tamil, which employs both language-independent features and language-unique features. First, we extract thirty-nine features from diverse speech dimensions such as voice quality, pronunciation, and prosody. Second, feature selections are applied to identify the optimal feature set for each language. A set of shared features and a set of distinctive features are distinguished by comparing the feature selection results of the three languages. Lastly, automatic severity classification is performed, utilizing the two feature sets. Notably, the proposed method removes different features by languages to prevent the negative effect of unique features for other languages. Accordingly, eXtreme Gradient Boosting (XGBoost) algorithm is employed for classification, due to its strength in imputing missing data. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVoice and Speech Disorders
MethodsFeature Selection
