Predicting lexical skills from oral reading with acoustic measures

Charvi Vitthal; Shreeharsha B S; Kamini Sabu; Preeti Rao

arXiv:2112.00635·eess.AS·December 2, 2021

Predicting lexical skills from oral reading with acoustic measures

Charvi Vitthal, Shreeharsha B S, Kamini Sabu, Preeti Rao

PDF

Open Access

TL;DR

This paper introduces a simple, language-agnostic acoustic feature-based system to predict children's word-decoding skills from oral reading recordings, offering a resource-efficient alternative to traditional speech recognition methods.

Contribution

The study proposes a novel acoustic feature set for predicting oral reading skills, reducing reliance on computationally intensive ASR systems and enabling language-agnostic literacy assessment.

Findings

01

Acoustic features like pause statistics and syllable rate effectively identify reading deficits.

02

The proposed method achieves performance comparable to language-dependent ASR systems.

03

It provides a computationally simple, resource-efficient tool for literacy assessment.

Abstract

Literacy assessment is an important activity for education administrators across the globe. Typically achieved in a school setting by testing a child's oral reading, it is intensive in human resources. While automatic speech recognition (ASR) is a potential solution to the problem, it tends to be computationally expensive for hand-held devices apart from needing language and accent-specific speech for training. In this work, we propose a system to predict the word-decoding skills of a student based on simple acoustic features derived from the recording. We first identify a meaningful categorization of word-decoding skills by analyzing a manually transcribed data set of children's oral reading recordings. Next the automatic prediction of the category is attempted with the proposed acoustic features. Pause statistics, syllable rate and spectral and intensity dynamics are found to be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and dialogue systems · Phonetics and Phonology Research