Post-hoc Bias Scoring Is Optimal For Fair Classification
Wenlong Chen, Yegor Klochkov, Yang Liu

TL;DR
This paper introduces a simple post-hoc bias scoring method for fair classification that optimally adjusts classifiers under various group fairness constraints, maintaining high accuracy.
Contribution
It provides an explicit characterization of the Bayes optimal classifier under fairness constraints and introduces a bias score for effective post-hoc adjustments.
Findings
Bias score enables optimal fairness adjustments
Thresholding bias scores achieves fairness-accuracy trade-offs
Method applies to multiple fairness criteria and sensitive attributes
Abstract
We consider a binary classification problem under group fairness constraints, which can be one of Demographic Parity (DP), Equalized Opportunity (EOp), or Equalized Odds (EO). We propose an explicit characterization of Bayes optimal classifier under the fairness constraints, which turns out to be a simple modification rule of the unconstrained classifier. Namely, we introduce a novel instance-level measure of bias, which we call bias score, and the modification rule is a simple linear rule on top of the finite amount of bias scores.Based on this characterization, we develop a post-hoc approach that allows us to adapt to fairness constraints while maintaining high accuracy. In the case of DP and EOp constraints, the modification rule is thresholding a single bias score, while in the case of EO constraints we are required to fit a linear modification rule with 2 parameters. The method can…
Peer Reviews
Decision·ICLR 2024 spotlight
- The paper proposes a novel way to modify the output of the unconstrained Bayes optimal classifier post-hoc in order to satisfy fairness constraints. While this approach has been previously studied, I believe the instance-level bias scores are novel. - The main significant technical contribution is the ability to satisfy Equalized Odds fairness constraints. - Besides these, the characterization of the optimal modification rule in Theorem 1, which has the form of a linear combination of bias sco
- No major weakness apart a few issues with the writing and minor typos that can be fixed with a revision.
-A key challenge in fair learning is access to sensitive attributes. This paper acknowledges the fact that sensitive attributes may not be accessible in practice, and therefore, proposes a post-processing algorithm that does not require such access. To the best of my knowledge, the proposed method is original and has significant impact. -The authors accompany their theoretical guarantees with an extensive experimental analysis to show the efficacy of their algorithm. -The paper is well-written
-While the proposed method does not require access to sensitive attributes at test time, it still requires the conditional distribution of sensitive attributes $P(A|X)$, or a good estimate of it. It is not clear if this complies with laws and regulations: a company can still use their model of $P(A|X)$ to get good estimates of individual’s sensitive attribute. I’d like to see a discussion of this in the paper as well. -Overall, the assumption that we have access to $P(Y, A|X)$, or a good estima
- Novel characterization of the optimal fairness-constrained classifier as a "simple" modification rule over the Bayes optimal classifier. - Group-specific thresholding (Hardt et al., 2016) is a specific case of this rule where the sensitive attribute data is known at inference time; proving that simple thresholding is optimal for DP and EO when this information is known (with Bayes optimal scores). - Specific examples given for DP, EO, and equalized odds. - The proposed method does not need
- No code or results files are provided for the experiments; neither an implementation for the proposed method. This is largest point against the current version of the paper, as properly reviewing the work required checking some experimental details. - Given that postprocessing baselines achieve Pareto dominant results in Fig. 2 (expectedly, as they have access to the sensitive attribute at inference time), it would be interesting to add partially relaxed results for these baselines for a more
Code & Models
Videos
Taxonomy
TopicsEthics and Social Impacts of AI
