Role of Intonation in Scoring Spoken English
Amber Nigam, Arpan Saxena, Ishan Sodhi

TL;DR
This paper introduces an intonation-based feature, SimIntonation, for automated scoring of nonnative English speech, demonstrating its high predictive power in controlled environments and analyzing pause types for granular evaluation.
Contribution
It presents a novel intonation similarity feature, SimIntonation, and a detailed pause categorization method to improve automated spoken English scoring systems.
Findings
SimIntonation is highly predictive of speech quality.
Macro features like accuracy and intonation significantly influence scoring.
Pause-topography features contribute to granular speech evaluation.
Abstract
In this paper, we have introduced and evaluated intonation based feature for scoring the English speech of nonnative English speakers in Indian context. For this, we created an automated spoken English scoring engine to learn from the manual evaluation of spoken English. This involved using an existing Automatic Speech Recognition (ASR) engine to convert the speech to text. Thereafter, macro features like accuracy, fluency and prosodic features were used to build a scoring model. In the process, we introduced SimIntonation, short for similarity between spoken intonation pattern and "ideal" i.e. training intonation pattern. Our results show that it is a highly predictive feature under controlled environment. We also categorized interword pauses into 4 distinct types for a granular evaluation of pauses and their impact on speech evaluation. Moreover, we took steps to moderate test…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Phonetics and Phonology Research · Natural Language Processing Techniques
