Natural Language-based Assessment of L2 Oral Proficiency using LLMs

Stefano Bann\`o; Rao Ma; Mengjie Qian; Siyuan Tang; Kate Knill; Mark Gales

arXiv:2507.10200·eess.AS·July 15, 2025

Natural Language-based Assessment of L2 Oral Proficiency using LLMs

Stefano Bann\`o, Rao Ma, Mengjie Qian, Siyuan Tang, Kate Knill, Mark Gales

PDF

TL;DR

This paper investigates using large language models with natural language descriptors to assess second language oral proficiency, demonstrating competitive performance and enhanced interpretability in a zero-shot setting.

Contribution

It introduces a natural language-based assessment approach using LLMs with can-do descriptors, showing its effectiveness and generalizability compared to specialized models.

Findings

01

Achieves competitive performance with LLMs using textual descriptors.

02

Outperforms BERT-based models trained specifically for assessment.

03

Effective in mismatched task settings and across languages.

Abstract

Natural language-based assessment (NLA) is an approach to second language assessment that uses instructions - expressed in the form of can-do descriptors - originally intended for human examiners, aiming to determine whether large language models (LLMs) can interpret and apply them in ways comparable to human assessment. In this work, we explore the use of such descriptors with an open-source LLM, Qwen 2.5 72B, to assess responses from the publicly available S&I Corpus in a zero-shot setting. Our results show that this approach - relying solely on textual information - achieves competitive performance: while it does not outperform state-of-the-art speech LLMs fine-tuned for the task, it surpasses a BERT-based model trained specifically for this purpose. NLA proves particularly effective in mismatched task settings, is generalisable to other data types and languages, and offers greater…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.