Controllable Spoken Dialogue Generation: An LLM-Driven Grading System for K-12 Non-Native English Learners

Haidong Yuan; Haokun Zhao; Wanshi Xu; Songjun Cao; Qingyu Zhou; Long Ma; Hongjie Fan

arXiv:2604.22542·cs.CL·April 27, 2026

Controllable Spoken Dialogue Generation: An LLM-Driven Grading System for K-12 Non-Native English Learners

Haidong Yuan, Haokun Zhao, Wanshi Xu, Songjun Cao, Qingyu Zhou, Long Ma, Hongjie Fan

PDF

TL;DR

This paper presents a proficiency-aligned framework for controllable spoken dialogue generation tailored to K-12 non-native English learners, utilizing LLMs with a new grading system and the DDPO algorithm.

Contribution

It introduces a novel framework with a four-tier lexical control system and the DDPO algorithm to enhance dialogue diversity, quality, and pedagogical value for language learners.

Findings

01

Achieved low out-of-vocabulary rates in generated dialogues.

02

Enhanced dialogue diversity and naturalness.

03

Framework adaptable to various educational standards.

Abstract

Large language models (LLMs) often fail to meet the pedagogical needs of K-12 English learners in non-native contexts due to a proficiency mismatch. To address this widespread challenge, we introduce a proficiency-aligned framework that adapts LLM outputs to learner abilities, using China's national curriculum (CSE) as a representative case. Our framework enables precise control over lexical complexity through a four-tier grading system, supported by a comprehensive suite of new resources: graded vocabulary lists and a multi-turn dialogue corpus. Our core technical contribution is the \textbf{DDPO} algorithm,Diversity Driven Policy Optimization, a multi-turn GRPO-based approach designed to preserve dialogue diversity while holistically optimizing dialogue quality. This method significantly outperforms conventional approaches, achieving low out-of-vocabulary rates and high diversity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.