Counterfactual Fairness by Combining Factual and Counterfactual   Predictions

Zeyu Zhou; Tianci Liu; Ruqi Bai; Jing Gao; Murat Kocaoglu; David I.; Inouye

arXiv:2409.01977·cs.LG·January 23, 2025

Counterfactual Fairness by Combining Factual and Counterfactual Predictions

Zeyu Zhou, Tianci Liu, Ruqi Bai, Jing Gao, Murat Kocaoglu, David I., Inouye

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper investigates the trade-off between counterfactual fairness and predictive accuracy in machine learning models, proposing methods to achieve fairness without sacrificing optimality and analyzing their performance under causal knowledge constraints.

Contribution

It introduces a theoretical framework for understanding the fairness-performance trade-off and proposes a practical algorithm to enforce counterfactual fairness with limited causal information.

Findings

01

Theoretical quantification of the fairness-performance trade-off.

02

A method to convert optimal predictors into fair ones without losing optimality.

03

Experimental validation on synthetic datasets supports the analysis.

Abstract

In high-stake domains such as healthcare and hiring, the role of machine learning (ML) in decision-making raises significant fairness concerns. This work focuses on Counterfactual Fairness (CF), which posits that an ML model's outcome on any individual should remain unchanged if they had belonged to a different demographic group. Previous works have proposed methods that guarantee CF. Notwithstanding, their effects on the model's predictive performance remains largely unclear. To fill in this gap, we provide a theoretical study on the inherent trade-off between CF and predictive performance in a model-agnostic manner. We first propose a simple but effective method to cast an optimal but potentially unfair predictor into a fair one without losing the optimality. By analyzing its excess risk in order to achieve CF, we quantify this inherent trade-off. Further analysis on our method's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

inouye-lab/pcf
pytorchOfficial

Videos

Counterfactual Fairness by Combining Factual and Counterfactual Predictions· slideslive

Taxonomy

TopicsEthics and Social Impacts of AI · Psychology of Moral and Emotional Judgment