Pattern-based Knowledge Component Extraction from Student Code Using Representation Learning
Muntasir Hoq, Griffin Pitts, Tirth Bhatt, Aum Pandya, Andrew Lan, Peter Brusilovsky, Bita Akram

TL;DR
This paper introduces a pattern-based, data-driven method for discovering knowledge components in student programming solutions, improving student learning modeling and prediction accuracy.
Contribution
It proposes a novel framework combining code representation, latent space abstraction, and clustering to identify meaningful KCs from student code, enhancing knowledge tracing models.
Findings
Discovered KCs align with student struggles and mastery patterns.
Enhanced predictive performance in student modeling with pattern-based KCs.
Learning curve analysis confirms KCs' relevance to learning theory.
Abstract
Personalized instruction aims to provide learners with support that adapts to their individual knowledge and progress toward learning objectives. Discovering and tracing Knowledge Components (KCs) is an important step in building accurate models of student learning. However, KC discovery in computer science education is challenging due to the open-ended nature of programming, wide variability in student solutions, and intertwined use of programming structures in code. We address these challenges with a pattern-based KC discovery method that uses a data-driven approach to define KCs as recurring structural patterns in student code that reveal persistent patterns of struggle and mastery in students' solutions. We then evaluate the discovered KCs using expert evaluation and statistical student modeling to demonstrate their effectiveness in capturing student learning and struggles. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
