Large Language Model-Driven Dynamic Assessment of Grammatical Accuracy in English Language Learner Writing

Timur Jaganov; John Blake; Juli\'an Villegas; Nicholas Carr

arXiv:2505.00931·cs.CL·September 8, 2025

Large Language Model-Driven Dynamic Assessment of Grammatical Accuracy in English Language Learner Writing

Timur Jaganov, John Blake, Juli\'an Villegas, Nicholas Carr

PDF

TL;DR

This paper explores using large language models to enhance dynamic assessment of grammatical accuracy in English learners, demonstrating that LLMs like GPT-4o can provide scalable, accurate, and high-quality feedback in language learning.

Contribution

The study develops DynaWrite, a modular system supporting multiple LLMs for dynamic assessment, and identifies GPT-4o as the most effective model for scalable grammatical feedback.

Findings

01

GPT-4o and neural chat effectively identify grammatical errors.

02

GPT-4o provides clearer, more explicit hints than neural chat.

03

GPT-4o demonstrates sufficient speed and stability for real-time assessment.

Abstract

This study investigates the potential for Large Language Models (LLMs) to scale-up Dynamic Assessment (DA). To facilitate such an investigation, we first developed DynaWrite-a modular, microservices-based grammatical tutoring application which supports multiple LLMs to generate dynamic feedback to learners of English. Initial testing of 21 LLMs, revealed GPT-4o and neural chat to have the most potential to scale-up DA in the language learning classroom. Further testing of these two candidates found both models performed similarly in their ability to accurately identify grammatical errors in user sentences. However, GPT-4o consistently outperformed neural chat in the quality of its DA by generating clear, consistent, and progressively explicit hints. Real-time responsiveness and system stability were also confirmed through detailed performance testing, with GPT-4o exhibiting sufficient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings