Scalable Early Childhood Reading Performance Prediction
Zhongkai Shangguan, Zanming Huang, Eshed Ohn-Bar, Ola, Ozernov-Palchik, Derek Kosty, Michael Stoolmiller, Hank Fien

TL;DR
This paper introduces a large-scale dataset for early childhood reading performance prediction and demonstrates a self-supervised learning approach with MLPs that outperforms baselines, aiding early educational interventions.
Contribution
The work provides the first large-scale, publicly available dataset for early reading prediction and shows the effectiveness of a self-supervised pre-training strategy with MLPs in this context.
Findings
Self-supervised MLP pre-training outperforms baseline models.
The dataset includes data from 44 schools, 6,916 students, and 172 teachers.
Models generalize well across diverse educational settings.
Abstract
Models for student reading performance can empower educators and institutions to proactively identify at-risk students, thereby enabling early and tailored instructional interventions. However, there are no suitable publicly available educational datasets for modeling and predicting future reading performance. In this work, we introduce the Enhanced Core Reading Instruction ECRI dataset, a novel large-scale longitudinal tabular dataset collected across 44 schools with 6,916 students and 172 teachers. We leverage the dataset to empirically evaluate the ability of state-of-the-art machine learning models to recognize early childhood educational patterns in multivariate and partial measurements. Specifically, we demonstrate a simple self-supervised strategy in which a Multi-Layer Perception (MLP) network is pre-trained over masked inputs to outperform several strong baselines while…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning
