KASER: Knowledge-Aligned Student Error Simulator for Open-Ended Coding Tasks

Zhangqi Duan; Nigel Fernandez; Andrew Lan

arXiv:2601.06633·cs.LG·May 19, 2026

KASER: Knowledge-Aligned Student Error Simulator for Open-Ended Coding Tasks

Zhangqi Duan, Nigel Fernandez, Andrew Lan

PDF

TL;DR

KASER is a novel reinforcement learning-based method that aligns student error simulation with knowledge, improving diversity and accuracy in predicting student responses for open-ended coding tasks.

Contribution

The paper introduces KASER, a new approach that enhances student error simulation by aligning errors with knowledge using a hybrid reward in reinforcement learning.

Findings

01

Outperforms baselines in code and error prediction at the student-problem level.

02

Achieves better error coverage and code diversity at the problem level.

03

Effectively models diverse student responses in real-world datasets.

Abstract

Open-ended tasks, such as coding problems that are common in computer science education, provide detailed insights into student knowledge. However, training large language models (LLMs) to simulate and predict possible student errors in their responses to these problems can be challenging: they often suffer from mode collapse and fail to fully capture the diversity in syntax, style, and solution approach in student responses. In this work, we present KASER (Knowledge-Aligned Student Error Simulator), a novel approach that aligns errors with student knowledge. We propose a training method based on reinforcement learning using a hybrid reward that reflects three aspects of student code prediction: i) code similarity to the ground-truth, ii) error matching, and iii) code prediction diversity. On two real-world datasets, we perform two levels of evaluation and show that: At the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTeaching and Learning Programming · Software Engineering Research · Online Learning and Analytics