TL;DR
This paper introduces an RNN-Attention-KD framework for early prediction of at-risk students in educational data mining, utilizing knowledge distillation to improve early detection accuracy throughout a course.
Contribution
It proposes a novel combination of RNNs, attention mechanisms, and knowledge distillation for early student performance prediction, addressing real-world dropout scenarios.
Findings
RNN-Attention-KD outperforms traditional models in recall and F1-score.
Early predictions achieved with high accuracy within the first few weeks.
Hint loss and context vector loss improve model performance.
Abstract
Educational data mining (EDM) is a part of applied computing that focuses on automatically analyzing data from learning contexts. Early prediction for identifying at-risk students is a crucial and widely researched topic in EDM research. It enables instructors to support at-risk students to stay on track, preventing student dropout or failure. Previous studies have predicted students' learning performance to identify at-risk students by using machine learning on data collected from e-learning platforms. However, most studies aimed to identify at-risk students utilizing the entire course data after the course finished. This does not correspond to the real-world scenario that at-risk students may drop out before the course ends. To address this problem, we introduce an RNN-Attention-KD (knowledge distillation) framework to predict at-risk students early throughout a course. It leverages…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSoftmax · Attention Is All You Need · Dropout · Hierarchical Information Threading · Focus
