Knowledge Distillation in RNN-Attention Models for Early Prediction of   Student Performance

Sukrit Leelaluk; Cheng Tang; Valdemar \v{S}v\'abensk\'y; Atsushi; Shimada

arXiv:2412.14526·cs.LG·December 20, 2024

Knowledge Distillation in RNN-Attention Models for Early Prediction of Student Performance

Sukrit Leelaluk, Cheng Tang, Valdemar \v{S}v\'abensk\'y, Atsushi, Shimada

PDF

1 Repo

TL;DR

This paper introduces an RNN-Attention-KD framework for early prediction of at-risk students in educational data mining, utilizing knowledge distillation to improve early detection accuracy throughout a course.

Contribution

It proposes a novel combination of RNNs, attention mechanisms, and knowledge distillation for early student performance prediction, addressing real-world dropout scenarios.

Findings

01

RNN-Attention-KD outperforms traditional models in recall and F1-score.

02

Early predictions achieved with high accuracy within the first few weeks.

03

Hint loss and context vector loss improve model performance.

Abstract

Educational data mining (EDM) is a part of applied computing that focuses on automatically analyzing data from learning contexts. Early prediction for identifying at-risk students is a crucial and widely researched topic in EDM research. It enables instructors to support at-risk students to stay on track, preventing student dropout or failure. Previous studies have predicted students' learning performance to identify at-risk students by using machine learning on data collected from e-learning platforms. However, most studies aimed to identify at-risk students utilizing the entire course data after the course finished. This does not correspond to the real-world scenario that at-risk students may drop out before the course ends. To address this problem, we introduce an RNN-Attention-KD (knowledge distillation) framework to predict at-risk students early throughout a course. It leverages…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

limu-research/2025-sac-rnn-attention-kd
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSoftmax · Attention Is All You Need · Dropout · Hierarchical Information Threading · Focus