Auditing the Imputation Effect on Fairness of Predictive Analytics in Higher Education
Hadis Anahideh, Parian Haghighat, Nazanin Nezami, Denisa G`andara

TL;DR
This paper examines how different imputation techniques affect the fairness of predictive models for student success in higher education, highlighting the importance of addressing societal inequalities to reduce bias.
Contribution
It provides a comprehensive analysis of the impact of imputation methods on fairness and performance in predictive analytics for higher education, using real large-scale data.
Findings
Imputation can introduce bias if test data reflects historical societal disparities.
Equalizing societal disparities in data reduces bias introduced by imputation.
Prospective evaluation offers a less biased estimate of model fairness and performance.
Abstract
Colleges and universities use predictive analytics in a variety of ways to increase student success rates. Despite the potential for predictive analytics, two major barriers exist to their adoption in higher education: (a) the lack of democratization in deployment, and (b) the potential to exacerbate inequalities. Education researchers and policymakers encounter numerous challenges in deploying predictive modeling in practice. These challenges present in different steps of modeling including data preparation, model development, and evaluation. Nevertheless, each of these steps can introduce additional bias to the system if not appropriately performed. Most large-scale and nationally representative education data sets suffer from a significant number of incomplete responses from the research participants. While many education-related studies addressed the challenges of missing data,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOnline Learning and Analytics
