Using LLMs for Knowledge Component-level Correctness Labeling in Open-ended Coding Problems

Zhangqi Duan; Arnav Kankaria; Dhruv Kartik; Andrew Lan

arXiv:2602.17542·cs.CL·March 31, 2026

Using LLMs for Knowledge Component-level Correctness Labeling in Open-ended Coding Problems

Zhangqi Duan, Arnav Kankaria, Dhruv Kartik, Andrew Lan

PDF

TL;DR

This paper presents an automated framework using large language models to generate fine-grained knowledge component correctness labels from student code, improving learning curve modeling and prediction accuracy.

Contribution

It introduces a novel LLM-based method for labeling KCs at the code level, addressing the lack of such labels in open-ended programming tasks.

Findings

01

Enhanced learning curve fit aligning with cognitive theory

02

Improved predictive performance over baselines

03

High agreement between LLM and expert annotations

Abstract

Fine-grained skill representations, commonly referred to as knowledge components (KCs), are fundamental to many approaches in student modeling and learning analytics. However, KC-level correctness labels are rarely available in real-world datasets, especially for open-ended programming tasks where solutions typically involve multiple KCs simultaneously. Simply propagating problem-level correctness to all associated KCs obscures partial mastery and often leads to poorly fitted learning curves. To address this challenge, we propose an automated framework that leverages large language models (LLMs) to label KC-level correctness directly from student-written code. Our method assesses whether each KC is correctly applied and further introduces a temporal context-aware Code-KC mapping mechanism to better align KCs with individual student code. We evaluate the resulting KC-level correctness…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.