Interpreting Latent Student Knowledge Representations in Programming Assignments
Nigel Fernandez, Andrew Lan

TL;DR
This paper introduces InfoOIRT, a model that interprets latent student knowledge in programming assignments by generating code and learning disentangled, interpretable representations of student skills and code features.
Contribution
The paper presents a novel information regularized model that enhances interpretability of student knowledge representations while accurately generating student code.
Findings
InfoOIRT effectively generates student code.
It produces interpretable, disentangled knowledge representations.
The model performs well on real-world programming data.
Abstract
Recent advances in artificial intelligence for education leverage generative large language models, including using them to predict open-ended student responses rather than their correctness only. However, the black-box nature of these models limits the interpretability of the learned student knowledge representations. In this paper, we conduct a first exploration into interpreting latent student knowledge representations by presenting InfoOIRT, an Information regularized Open-ended Item Response Theory model, which encourages the latent student knowledge states to be interpretable while being able to generate student-written code for open-ended programming questions. InfoOIRT maximizes the mutual information between a fixed subset of latent knowledge states enforced with simple prior distributions and generated student code, which encourages the model to learn disentangled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTeaching and Learning Programming · Online Learning and Analytics · Innovative Teaching and Learning Methods
