Evaluating the Explainers: Black-Box Explainable Machine Learning for Student Success Prediction in MOOCs
Vinitra Swamy, Bahar Radmehr, Natasa Krco, Mirko Marras, Tanja K\"aser

TL;DR
This study evaluates five explainability methods for neural network models predicting student success in MOOCs, revealing significant discrepancies among explanations and emphasizing the importance of explainer choice.
Contribution
It systematically compares multiple explainers on MOOC data, highlighting their inconsistencies and the critical impact of explainer selection on interpretation.
Findings
Explainability methods often disagree on feature importance.
Explainer choice significantly affects interpretation more than course differences.
Quantitative analysis confirms variability across explainers and courses.
Abstract
Neural networks are ubiquitous in applied machine learning for education. Their pervasive success in predictive performance comes alongside a severe weakness, the lack of explainability of their decisions, especially relevant in human-centric fields. We implement five state-of-the-art methodologies for explaining black-box machine learning models (LIME, PermutationSHAP, KernelSHAP, DiCE, CEM) and examine the strengths of each approach on the downstream task of student performance prediction for five massive open online courses. Our experiments demonstrate that the families of explainers do not agree with each other on feature importance for the same Bidirectional LSTM models with the same representative set of students. We use Principal Component Analysis, Jensen-Shannon distance, and Spearman's rank-order correlation to quantitatively cross-examine explanations across methods and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Online Learning and Analytics · Machine Learning and Data Classification
MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory
