Simulating Novice Students Using Machine Unlearning and Relearning in Large Language Models
Jiajia Song, Zhihan Guo, Jionghao Lin

TL;DR
This paper introduces a novel method using machine unlearning to create stable novice-level AI students in large language models for learning-by-teaching simulations, addressing the issue of models drifting to expert-level responses.
Contribution
It proposes a knowledge-level simulation approach with machine unlearning, enabling LLMs to maintain and recover novice knowledge during teaching interactions.
Findings
Unlearning yields more novice-like responses than prompt-only methods.
Agents can recover some unlearned knowledge through structured learning.
Dialogue analysis reveals trajectories of conceptual change and teaching strategies.
Abstract
Student simulation can support learning-by-teaching pedagogy where human students (as tutors) teach AI-simulated novice students (as tutees). Recent research often relies on prompt engineering with large language models (LLMs) to simulate novice student behaviour, but it is difficult to keep the AI-simulated student at a stable novice knowledge level. A key reason is that many LLMs are trained to be broadly capable, so even when prompted to "act like a novice," the LLMs can still produce expert-level explanations during the learning-by-teaching interaction process. As a result, the AI-simulated student may drift beyond the intended knowledge level, reducing the credibility of the simulation for studying learning-by-teaching processes. Thus, we propose a knowledge-level simulation approach based on machine unlearning. We investigate this approach using a dataset of multiple-choice…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
