Evolutionary Reinforcement Learning based AI tutor for Socratic Interdisciplinary Instruction

Mei Jiang; Haihai Shen; Zhuo Luo; Bingdong Li; Wenjing Hong; Ke Tang; Aimin Zhou

arXiv:2512.11930·cs.CY·December 16, 2025

Evolutionary Reinforcement Learning based AI tutor for Socratic Interdisciplinary Instruction

Mei Jiang, Haihai Shen, Zhuo Luo, Bingdong Li, Wenjing Hong, Ke Tang, Aimin Zhou

PDF

Open Access

TL;DR

This paper introduces ERL4SIIP, an evolutionary reinforcement learning framework designed to enhance AI tutors for Socratic interdisciplinary STEM education by modeling student cognition and addressing reward sparsity.

Contribution

It formalizes the Socratic Interdisciplinary Instructional Problem as a POMDP and develops a novel ERL framework combining a student simulator, hierarchical rewards, and hybrid optimization strategies.

Findings

01

Effective modeling of student cognitive states.

02

Improved policy diversity and exploration.

03

Enhanced long-term educational goal achievement.

Abstract

Cultivating higher-order cognitive abilities -- such as knowledge integration, critical thinking, and creativity -- in modern STEM education necessitates a pedagogical shift from passive knowledge transmission to active Socratic construction. Although Large Language Models (LLMs) hold promise for STEM Interdisciplinary education, current methodologies employing Prompt Engineering (PE), Supervised Fine-tuning (SFT), or standard Reinforcement Learning (RL) often fall short of supporting this paradigm. Existing methods are hindered by three fundamental challenges: the inability to dynamically model latent student cognitive states; severe reward sparsity and delay inherent in long-term educational goals; and a tendency toward policy collapse lacking strategic diversity due to reliance on behavioral cloning. Recognizing the unobservability and dynamic complexity of these interactions, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning · Evolutionary Algorithms and Applications · Reinforcement Learning in Robotics