Delving Deeper into MOOC Student Dropout Prediction
Jacob Whitehill, Kiran Mohan, Daniel Seaton, Yigal Rosen, and Dustin, Tingley

TL;DR
This study evaluates how different training and testing regimes affect MOOC dropout prediction accuracy, revealing that realistic evaluation methods are crucial and that deep neural networks outperform logistic regression.
Contribution
It systematically compares training paradigms for MOOC dropout prediction and introduces deep neural networks, demonstrating their superior performance over traditional logistic regression.
Findings
Post-hoc training overestimates accuracy by several percentage points.
Proxy label-based classifiers are competitive with post-hoc training.
Neural networks with up to 5 hidden layers significantly improve accuracy.
Abstract
In order to obtain reliable accuracy estimates for automatic MOOC dropout predictors, it is important to train and test them in a manner consistent with how they will be used in practice. Yet most prior research on MOOC dropout prediction has measured test accuracy on the same course used for training the classifier, which can lead to overly optimistic accuracy estimates. In order to understand better how accuracy is affected by the training+testing regime, we compared the accuracy of a standard dropout prediction architecture (clickstream features + logistic regression) across 4 different training paradigms. Results suggest that (1) training and testing on the same course ("post-hoc") can overestimate accuracy by several percentage points; (2) dropout classifiers trained on proxy labels based on students' persistence are surprisingly competitive with post-hoc training (87.33% versus…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOnline Learning and Analytics · Advancements in Semiconductor Devices and Circuit Design
MethodsDropout
