Automated evaluation of children's speech fluency for low-resource languages
Bowen Zhang, Nur Afiqah Abdul Latiff, Justin Kan, Rong Tong, Donny Soh, Xiaoxiao Miao, Ian McLoughlin

TL;DR
This paper introduces an automated system for assessing children's speech fluency in low-resource languages using a combination of multilingual ASR, objective metrics, and GPT-based classification, outperforming existing methods.
Contribution
It presents a novel integrated approach combining multilingual ASR, objective metrics, and GPT-based classification specifically for low-resource languages.
Findings
Higher accuracy than Random Forest and XGBoost classifiers.
GPT-based classifier outperforms multimodal GPT in fluency scoring.
Effective assessment demonstrated on Tamil and Malay datasets.
Abstract
Assessment of children's speaking fluency in education is well researched for majority languages, but remains highly challenging for low resource languages. This paper proposes a system to automatically assess fluency by combining a fine-tuned multilingual ASR model, an objective metrics extraction stage, and a generative pre-trained transformer (GPT) network. The objective metrics include phonetic and word error rates, speech rate, and speech-pause duration ratio. These are interpreted by a GPT-based classifier guided by a small set of human-evaluated ground truth examples, to score fluency. We evaluate the proposed system on a dataset of children's speech in two low-resource languages, Tamil and Malay and compare the classification performance against Random Forest and XGBoost, as well as using ChatGPT-4o to predict fluency directly from speech input. Results demonstrate that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLanguage Development and Disorders
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Cosine Annealing · Linear Layer · Layer Normalization · Byte Pair Encoding · Residual Connection · Discriminative Fine-Tuning · Dense Connections · Linear Warmup With Cosine Annealing
