Simplicity Creates Inequity: Implications for Fairness, Stereotypes, and Interpretability
Jon Kleinberg, Sendhil Mullainathan

TL;DR
This paper demonstrates that simplicity in prediction models can lead to increased inequity and bias, highlighting a fundamental trade-off between interpretability and fairness in decision-making algorithms.
Contribution
The paper introduces a formal framework showing that simple prediction functions are inherently improvable and can induce biases against disadvantaged groups, revealing a fundamental conflict between simplicity and equity.
Findings
Simple prediction functions are strictly improvable for better efficiency and fairness.
Simplicity reduces overall welfare and harms disadvantaged groups.
Simplification creates incentives for biased use of group membership information.
Abstract
Algorithms are increasingly used to aid, or in some cases supplant, human decision-making, particularly for decisions that hinge on predictions. As a result, two additional features in addition to prediction quality have generated interest: (i) to facilitate human interaction and understanding with these algorithms, we desire prediction functions that are in some fashion simple or interpretable; and (ii) because they influence consequential decisions, we also want them to produce equitable allocations. We develop a formal model to explore the relationship between the demands of simplicity and equity. Although the two concepts appear to be motivated by qualitatively distinct goals, we show a fundamental inconsistency between them. Specifically, we formalize a general framework for producing simple prediction functions, and in this framework we establish two basic results. First, every…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
