Inherent Tradeoffs in Learning Fair Representations
Han Zhao, Geoffrey J. Gordon

TL;DR
This paper establishes fundamental limits on the tradeoff between fairness and accuracy in classification, showing that achieving statistical parity can inherently cause errors, especially when group base rates differ, and provides algorithms to optimize this balance.
Contribution
The paper characterizes an inherent tradeoff between fairness and accuracy, providing lower bounds, tight algorithms, and efficient computation methods for fair classification and representations.
Findings
Any fair classifier with statistical parity incurs large errors if group base rates differ.
The lower bounds on joint error are tight and can be computed via linear programming.
Experimental results confirm the theoretical tradeoffs and bounds.
Abstract
Real-world applications of machine learning tools in high-stakes domains are often regulated to be fair, in the sense that the predicted target should satisfy some quantitative notion of parity with respect to a protected attribute. However, the exact tradeoff between fairness and accuracy is not entirely clear, even for the basic paradigm of classification problems. In this paper, we characterize an inherent tradeoff between statistical parity and accuracy in the classification setting by providing a lower bound on the sum of group-wise errors of any fair classifiers. Our impossibility theorem could be interpreted as a certain uncertainty principle in fairness: if the base rates differ among groups, then any fair classifier satisfying statistical parity has to incur a large error on at least one of the groups. We further extend this result to give a lower bound on the joint error of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Privacy-Preserving Technologies in Data
