Misconception Acquisition Dynamics in Large Language Models
Naiming Liu, Xinghe Chen, Richard Baraniuk, Mrinmaya Sachan, Shashank Sonkar

TL;DR
This paper investigates how large language models acquire misconceptions in educational contexts, revealing different dynamics for student and tutor models and emphasizing the importance of intermediate reasoning supervision.
Contribution
It introduces two misconception-aware models and MalAlgoLib for generating problem traces, providing insights into misconception learning dynamics and training strategies.
Findings
Student models overapply misconceptions across problems without correct boundaries.
Tutor models can learn multiple misconceptions without losing correct reasoning.
Intermediate reasoning steps are crucial for misconception acquisition, requiring supervision beyond final answers.
Abstract
Effective educational AI depends on modeling student misconceptions. Such models enable realistic learner simulation and diagnostic, adaptive tutoring. However, instruction-tuning large language models on student responses containing misconception errors can degrade reasoning abilities, creating a tension between faithful misconception modeling and preserving correct reasoning in other contexts. To support both learner simulation and tutoring, we study two misconception-aware models: the Novice Student Misconception Model, trained to acquire a single misconception for simulating an individual student, and the Expert Tutor Misconception Model, trained on multiple misconceptions to capture the error patterns a tutor encounters across students. To study the misconception acquisition dynamics of both models, we develop MalAlgoLib, a library that generates algebra problems with correct…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
