Evolutionary Reinforcement Learning based AI tutor for Socratic Interdisciplinary Instruction
Mei Jiang, Haihai Shen, Zhuo Luo, Bingdong Li, Wenjing Hong, Ke Tang, Aimin Zhou

TL;DR
This paper introduces ERL4SIIP, an evolutionary reinforcement learning framework designed to enhance AI tutors for Socratic interdisciplinary STEM education by modeling student cognition and addressing reward sparsity.
Contribution
It formalizes the Socratic Interdisciplinary Instructional Problem as a POMDP and develops a novel ERL framework combining a student simulator, hierarchical rewards, and hybrid optimization strategies.
Findings
Effective modeling of student cognitive states.
Improved policy diversity and exploration.
Enhanced long-term educational goal achievement.
Abstract
Cultivating higher-order cognitive abilities -- such as knowledge integration, critical thinking, and creativity -- in modern STEM education necessitates a pedagogical shift from passive knowledge transmission to active Socratic construction. Although Large Language Models (LLMs) hold promise for STEM Interdisciplinary education, current methodologies employing Prompt Engineering (PE), Supervised Fine-tuning (SFT), or standard Reinforcement Learning (RL) often fall short of supporting this paradigm. Existing methods are hindered by three fundamental challenges: the inability to dynamically model latent student cognitive states; severe reward sparsity and delay inherent in long-term educational goals; and a tendency toward policy collapse lacking strategic diversity due to reliance on behavioral cloning. Recognizing the unobservability and dynamic complexity of these interactions, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Evolutionary Algorithms and Applications · Reinforcement Learning in Robotics
