NLP-Informed Dynamic Cognitive Diagnosis Modelling
Yawen Ma, Sahoko Ishida, Kate Cain, Gabriel Wallin

TL;DR
This paper introduces a Bayesian framework that integrates natural language processing of question texts into dynamic cognitive diagnosis models to improve skill estimation in digital learning environments.
Contribution
It presents a novel method combining NLP-derived semantic information with Bayesian dynamic CDMs to enhance Q-matrix estimation and skill modeling.
Findings
Text-derived priors improve Q-matrix recovery.
The framework enhances model parameters in data-limited scenarios.
Application to digital reading data shows better skill development modeling.
Abstract
Digital learning platforms are increasingly used to support reading development while generating rich log files and item-level textual content. Using these data, this study proposes a dynamic cognitive diagnostic modelling (CDM) framework that incorporates text-derived semantic information to inform the estimation of the Q-matrix. We construct item-level semantic representations of question text and response options, and use these representations to define an informative prior on the Q-matrix. This approach treats text-derived signals as proxies for item complexity and cognitive demands, guiding the item-skill mapping in a data-driven manner. The proposed framework jointly estimates latent skill mastery profiles, item parameters, and transition dynamics over time within a Bayesian framework. We apply the model to data from Boost Reading, a digital reading supplement, focusing on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
