The Impact of ASR on the Automatic Analysis of Linguistic Complexity and Sophistication in Spontaneous L2 Speech
Yu Qiao, Wei Zhou, Elma Kerz, Ralf Schl\"uter

TL;DR
This study investigates how the performance of state-of-the-art ASR systems influences the automatic analysis of linguistic complexity in spontaneous L2 speech, highlighting the importance of ASR accuracy for reliable linguistic assessment.
Contribution
It provides an empirical evaluation of the impact of ASR quality on various linguistic complexity measures in spontaneous L2 speech analysis.
Findings
ASR performance significantly affects the accuracy of complexity measures.
Certain complexity measures are more sensitive to ASR errors.
Task type influences the relationship between ASR quality and measure accuracy.
Abstract
In recent years, automated approaches to assessing linguistic complexity in second language (L2) writing have made significant progress in gauging learner performance, predicting human ratings of the quality of learner productions, and benchmarking L2 development. In contrast, there is comparatively little work in the area of speaking, particularly with respect to fully automated approaches to assessing L2 spontaneous speech. While the importance of a well-performing ASR system is widely recognized, little research has been conducted to investigate the impact of its performance on subsequent automatic text analysis. In this paper, we focus on this issue and examine the impact of using a state-of-the-art ASR system for subsequent automatic analysis of linguistic complexity in spontaneously produced L2 speech. A set of 30 selected measures were considered, falling into four categories:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Text Readability and Simplification · Topic Modeling
