A statistical framework for fair predictive algorithms
Kristian Lum, James Johndrow

TL;DR
This paper introduces a probabilistic framework to define and remove bias in predictive algorithms by eliminating protected variable information, demonstrating its effectiveness in creating fairer criminal justice predictions.
Contribution
It proposes a general method to eliminate bias by removing protected variable information, applicable to various data types, and shows its effectiveness in reducing racial disparities in predictions.
Findings
Removing protected variables reduces racial disparities.
The method maintains predictive accuracy.
Omitting race alone does not eliminate bias.
Abstract
Predictive modeling is increasingly being employed to assist human decision-makers. One purported advantage of replacing human judgment with computer models in high stakes settings-- such as sentencing, hiring, policing, college admissions, and parole decisions-- is the perceived "neutrality" of computers. It is argued that because computer models do not hold personal prejudice, the predictions they produce will be equally free from prejudice. There is growing recognition that employing algorithms does not remove the potential for bias, and can even amplify it, since training data were inevitably generated by a process that is itself biased. In this paper, we provide a probabilistic definition of algorithmic bias. We propose a method to remove bias from predictive models by removing all information regarding protected variables from the permitted training data. Unlike previous work in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Criminal Justice and Corrections Analysis · Ethics and Social Impacts of AI
