A statistical framework for fair predictive algorithms

Kristian Lum; James Johndrow

arXiv:1610.08077·stat.ML·October 27, 2016·57 cites

A statistical framework for fair predictive algorithms

Kristian Lum, James Johndrow

PDF

Open Access

TL;DR

This paper introduces a probabilistic framework to define and remove bias in predictive algorithms by eliminating protected variable information, demonstrating its effectiveness in creating fairer criminal justice predictions.

Contribution

It proposes a general method to eliminate bias by removing protected variable information, applicable to various data types, and shows its effectiveness in reducing racial disparities in predictions.

Findings

01

Removing protected variables reduces racial disparities.

02

The method maintains predictive accuracy.

03

Omitting race alone does not eliminate bias.

Abstract

Predictive modeling is increasingly being employed to assist human decision-makers. One purported advantage of replacing human judgment with computer models in high stakes settings-- such as sentencing, hiring, policing, college admissions, and parole decisions-- is the perceived "neutrality" of computers. It is argued that because computer models do not hold personal prejudice, the predictions they produce will be equally free from prejudice. There is growing recognition that employing algorithms does not remove the potential for bias, and can even amplify it, since training data were inevitably generated by a process that is itself biased. In this paper, we provide a probabilistic definition of algorithmic bias. We propose a method to remove bias from predictive models by removing all information regarding protected variables from the permitted training data. Unlike previous work in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Criminal Justice and Corrections Analysis · Ethics and Social Impacts of AI