Inherent Trade-Offs in the Fair Determination of Risk Scores
Jon Kleinberg, Sendhil Mullainathan, Manish Raghavan

TL;DR
This paper formalizes three core fairness conditions in algorithmic risk scoring, proving their mutual incompatibility in most cases and highlighting the inherent trade-offs in achieving fairness.
Contribution
It introduces a formal framework for understanding the fundamental trade-offs between different fairness notions in risk scores.
Findings
No method can satisfy all three fairness conditions simultaneously in general.
Achieving approximate fairness requires data to be in special constrained cases.
The results clarify the incompatibility of key fairness notions in algorithmic classification.
Abstract
Recent discussion in the public sphere about algorithmic classification has involved tension between competing notions of what it means for a probabilistic classification to be fair to different groups. We formalize three fairness conditions that lie at the heart of these debates, and we prove that except in highly constrained special cases, there is no method that can satisfy these three conditions simultaneously. Moreover, even satisfying all three conditions approximately requires that the data lie in an approximate version of one of the constrained special cases identified by our theorem. These results suggest some of the ways in which key notions of fairness are incompatible with each other, and hence provide a framework for thinking about the trade-offs between them.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Philosophy and History of Science · Explainable Artificial Intelligence (XAI)
