Predicting lexical skills from oral reading with acoustic measures
Charvi Vitthal, Shreeharsha B S, Kamini Sabu, Preeti Rao

TL;DR
This paper introduces a simple, language-agnostic acoustic feature-based system to predict children's word-decoding skills from oral reading recordings, offering a resource-efficient alternative to traditional speech recognition methods.
Contribution
The study proposes a novel acoustic feature set for predicting oral reading skills, reducing reliance on computationally intensive ASR systems and enabling language-agnostic literacy assessment.
Findings
Acoustic features like pause statistics and syllable rate effectively identify reading deficits.
The proposed method achieves performance comparable to language-dependent ASR systems.
It provides a computationally simple, resource-efficient tool for literacy assessment.
Abstract
Literacy assessment is an important activity for education administrators across the globe. Typically achieved in a school setting by testing a child's oral reading, it is intensive in human resources. While automatic speech recognition (ASR) is a potential solution to the problem, it tends to be computationally expensive for hand-held devices apart from needing language and accent-specific speech for training. In this work, we propose a system to predict the word-decoding skills of a student based on simple acoustic features derived from the recording. We first identify a meaningful categorization of word-decoding skills by analyzing a manually transcribed data set of children's oral reading recordings. Next the automatic prediction of the category is attempted with the proposed acoustic features. Pause statistics, syllable rate and spectral and intensity dynamics are found to be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and dialogue systems · Phonetics and Phonology Research
