Rawlsian Fair Adaptation of Deep Learning Classifiers
Kulin Shah, Pooja Gupta, Amit Deshpande, Chiranjib Bhattacharyya

TL;DR
This paper introduces a Rawlsian approach to group fairness in classification, focusing on minimizing error for the worst-off group, and provides practical algorithms for fair adaptation of black-box models without retraining.
Contribution
It proposes a novel Rawlsian fairness criterion for classifiers, and develops efficient algorithms for adapting existing deep models to meet this fairness without retraining.
Findings
Significant reduction in worst-group error rates.
Outperforms state-of-the-art group-fair algorithms.
Applicable to black-box models without retraining.
Abstract
Group-fairness in classification aims for equality of a predictive utility across different sensitive sub-populations, e.g., race or gender. Equality or near-equality constraints in group-fairness often worsen not only the aggregate utility but also the utility for the least advantaged sub-population. In this paper, we apply the principles of Pareto-efficiency and least-difference to the utility being accuracy, as an illustrative example, and arrive at the Rawls classifier that minimizes the error rate on the worst-off sensitive sub-population. Our mathematical characterization shows that the Rawls classifier uniformly applies a threshold to an ideal score of features, in the spirit of fair equality of opportunity. In practice, such a score or a feature representation is often computed by a black-box model that has been useful but unfair. Our second contribution is practical Rawlsian…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
