Cross-Course Generalizability of SRL-Aligned Predictive Models Using Digital Learning Traces
Jakob Schwerter, Loreen Sabel, Judith Bose, Matthew L. Bernacki, Di Xu, Marko Schmellenkamp, Thomas Zeume, and Philipp Doebler

TL;DR
This study evaluates the generalizability of SRL-aligned predictive models using digital learning traces across different computer science courses and universities, highlighting the importance of context in early at-risk student identification.
Contribution
It demonstrates that while early prediction of at-risk students is possible with digital traces, model robustness across different educational settings varies significantly.
Findings
Random Forest achieved highest in-sample accuracy.
Elastic Net generalized more robustly across different contexts.
Model calibration declined when applied across institutions with different at-risk rates.
Abstract
STEM dropout rates remain high at universities, particularly in computer science programs with theory-intensive courses. Digital learning environments now capture rich behavioral data that could help identify struggling students early, yet the generalizability of data-driven prediction models across courses and institutions remains uncertain. Guided by self-regulated learning (SRL) theory, this study analyzed multimodal digital-trace data from three undergraduate theoretical computer science courses (N1 = 137, N2 = 104, N3 = 148) at two universities. Weekly SRL-aligned digital-trace indicators were modeled using Elastic Net, Random Forest, and XGBoost to evaluate predictive performance over time and across settings, and model calibration both within and across courses. Early prediction of at-risk students was feasible, with SRL-related behaviors such as time management, effort…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
