Simulating Students or Sycophantic Problem Solving? On Misconception Faithfulness of LLM Simulators
Heejin Do, Shashank Sonkar, Mrinmaya Sachan

TL;DR
This paper evaluates whether large language models (LLMs) can simulate student misconceptions during interactions and introduces a framework to measure and improve their misconception faithfulness.
Contribution
The paper presents a novel framework and metrics for assessing and enhancing LLM simulators' ability to maintain and update misconceptions interactively.
Findings
LLMs show near-zero misconception flip scores, indicating poor misconception faithfulness.
Models tend to behave as problem-solvers rather than belief-maintaining students.
Post-training methods improve misconception faithfulness, with reinforcement learning showing consistent gains.
Abstract
Large language models (LLMs) can fluently generate student-like responses, making them attractive as simulated students for training and evaluating AI tutors and human educators. Yet such simulators are typically evaluated by output similarity to real students, not by whether they behave like students with coherent misconceptions during interaction. We introduce a controlled framework for evaluating misconception faithfulness, whether a simulator maintains a misconception-driven belief state and updates selectively when feedback addresses the underlying misconception. Central to our framework is a misconception-contrastive feedback protocol that compares targeted feedback against two controls: misaligned feedback (targeting a different but plausible misconception) and generic feedback (only identifying answer is wrong). We propose Selective Flip Score (SFS), which quantifies how much…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
