Towards Fair and Privacy-Aware Transfer Learning for Educational Predictive Modeling: A Case Study on Retention Prediction in Community Colleges
Chengyuan Yao, Carmen Cortez, Renzhe Yu

TL;DR
This study explores transfer learning for retention prediction in community colleges, demonstrating how privacy constraints and contextual information influence model fairness and performance across institutions.
Contribution
It introduces strategies for privacy-aware transfer learning in educational settings, highlighting methods to improve fairness and accuracy without compromising privacy.
Findings
Transfer learning can improve retention prediction fairness and performance.
Contextual information helps forecast and mitigate model performance drops.
Sequential training based on demographic similarities enhances fairness.
Abstract
Predictive analytics is widely used in learning analytics, but many resource-constrained institutions lack the capacity to develop their own models or rely on proprietary ones trained in different contexts with little transparency. Transfer learning holds promise for expanding equitable access to predictive analytics but remains underexplored due to legal and technical constraints. This paper examines transfer learning strategies for retention prediction at U.S. two-year community colleges. We envision a scenario where community colleges collaborate with each other and four-year universities to develop retention prediction models under privacy constraints and evaluate risks and improvement strategies of cross-institutional model transfer. Using administrative records from 4 research universities and 23 community colleges covering over 800,000 students across 7 cohorts, we identify…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOnline Learning and Analytics · Privacy-Preserving Technologies in Data · Intelligent Tutoring Systems and Adaptive Learning
