Conversational Education at Scale: A Multi-LLM Agent Workflow for Procedural Learning and Pedagogic Quality Assessment

Jiahuan Pei; Fanghua Ye; Xin Sun; Wentao Deng; Koen Hindriks; Junxiao Wang

arXiv:2507.05528·cs.AI·September 8, 2025

Conversational Education at Scale: A Multi-LLM Agent Workflow for Procedural Learning and Pedagogic Quality Assessment

Jiahuan Pei, Fanghua Ye, Xin Sun, Wentao Deng, Koen Hindriks, Junxiao Wang

PDF

Open Access

TL;DR

This paper introduces WikiHowAgent, a multi-LLM agent framework for scalable procedural learning and pedagogic quality assessment, supported by a large dataset of teacher-learner interactions across multiple domains.

Contribution

It presents a novel multi-agent workflow leveraging LLMs for interactive education and introduces a large, diverse dataset for evaluating pedagogic quality.

Findings

01

Effective in diverse educational setups

02

Provides insights into LLM capabilities across domains

03

Open-sourced dataset and implementation

Abstract

Large language models (LLMs) have advanced virtual educators and learners, bridging NLP with AI4Education. Existing work often lacks scalability and fails to leverage diverse, large-scale course content, with limited frameworks for assessing pedagogic quality. To this end, we propose WikiHowAgent, a multi-agent workflow leveraging LLMs to simulate interactive teaching-learning conversations. It integrates teacher and learner agents, an interaction manager, and an evaluator to facilitate procedural learning and assess pedagogic quality. We introduce a dataset of 114,296 teacher-learner conversations grounded in 14,287 tutorials across 17 domains and 727 topics. Our evaluation protocol combines computational and rubric-based metrics with human judgment alignment. Results demonstrate the workflow's effectiveness in diverse setups, offering insights into LLM capabilities across domains. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning · Innovative Teaching and Learning Methods · Topic Modeling