Post-hoc Bias Scoring Is Optimal For Fair Classification

Wenlong Chen; Yegor Klochkov; Yang Liu

arXiv:2310.05725·stat.ML·March 18, 2024·2 cites

Post-hoc Bias Scoring Is Optimal For Fair Classification

Wenlong Chen, Yegor Klochkov, Yang Liu

PDF

Open Access 1 Repo 1 Video 3 Reviews

TL;DR

This paper introduces a simple post-hoc bias scoring method for fair classification that optimally adjusts classifiers under various group fairness constraints, maintaining high accuracy.

Contribution

It provides an explicit characterization of the Bayes optimal classifier under fairness constraints and introduces a bias score for effective post-hoc adjustments.

Findings

01

Bias score enables optimal fairness adjustments

02

Thresholding bias scores achieves fairness-accuracy trade-offs

03

Method applies to multiple fairness criteria and sensitive attributes

Abstract

We consider a binary classification problem under group fairness constraints, which can be one of Demographic Parity (DP), Equalized Opportunity (EOp), or Equalized Odds (EO). We propose an explicit characterization of Bayes optimal classifier under the fairness constraints, which turns out to be a simple modification rule of the unconstrained classifier. Namely, we introduce a novel instance-level measure of bias, which we call bias score, and the modification rule is a simple linear rule on top of the finite amount of bias scores.Based on this characterization, we develop a post-hoc approach that allows us to adapt to fairness constraints while maintaining high accuracy. In the case of DP and EOp constraints, the modification rule is thresholding a single bias score, while in the case of EO constraints we are required to fit a linear modification rule with 2 parameters. The method can…

Peer Reviews

Decision·ICLR 2024 spotlight

Reviewer 01Rating 8· accept, good paperConfidence 4

Strengths

- The paper proposes a novel way to modify the output of the unconstrained Bayes optimal classifier post-hoc in order to satisfy fairness constraints. While this approach has been previously studied, I believe the instance-level bias scores are novel. - The main significant technical contribution is the ability to satisfy Equalized Odds fairness constraints. - Besides these, the characterization of the optimal modification rule in Theorem 1, which has the form of a linear combination of bias sco

Weaknesses

- No major weakness apart a few issues with the writing and minor typos that can be fixed with a revision.

Reviewer 02Rating 8· accept, good paperConfidence 4

Strengths

-A key challenge in fair learning is access to sensitive attributes. This paper acknowledges the fact that sensitive attributes may not be accessible in practice, and therefore, proposes a post-processing algorithm that does not require such access. To the best of my knowledge, the proposed method is original and has significant impact. -The authors accompany their theoretical guarantees with an extensive experimental analysis to show the efficacy of their algorithm. -The paper is well-written

Weaknesses

-While the proposed method does not require access to sensitive attributes at test time, it still requires the conditional distribution of sensitive attributes $P(A|X)$, or a good estimate of it. It is not clear if this complies with laws and regulations: a company can still use their model of $P(A|X)$ to get good estimates of individual’s sensitive attribute. I’d like to see a discussion of this in the paper as well. -Overall, the assumption that we have access to $P(Y, A|X)$, or a good estima

Reviewer 03Rating 8· accept, good paperConfidence 3

Strengths

- Novel characterization of the optimal fairness-constrained classifier as a "simple" modification rule over the Bayes optimal classifier. - Group-specific thresholding (Hardt et al., 2016) is a specific case of this rule where the sensitive attribute data is known at inference time; proving that simple thresholding is optimal for DP and EO when this information is known (with Bayes optimal scores). - Specific examples given for DP, EO, and equalized odds. - The proposed method does not need

Weaknesses

- No code or results files are provided for the experiments; neither an implementation for the proposed method. This is largest point against the current version of the paper, as properly reviewing the work required checking some experimental details. - Given that postprocessing baselines achieve Pareto dominant results in Fig. 2 (expectedly, as they have access to the sensitive attribute at inference time), it would be interesting to add partially relaxed results for these baselines for a more

Code & Models

Repositories

chenw20/biasscore
pytorchOfficial

Videos

Post-hoc bias scoring is optimal for fair classification· slideslive

Taxonomy

TopicsEthics and Social Impacts of AI