Learning for Counterfactual Fairness from Observational Data
Jing Ma, Ruocheng Guo, Aidong Zhang, Jundong Li

TL;DR
This paper introduces CLAIRE, a novel framework for counterfactual fairness that learns from observational data without requiring explicit causal models, using data augmentation and invariant penalties to mitigate bias.
Contribution
It proposes a new method for counterfactual fairness that does not depend on known causal models, addressing a key challenge in real-world applications.
Findings
CLAIRE outperforms existing methods in counterfactual fairness.
The framework achieves competitive prediction accuracy.
Experimental results validate its effectiveness on synthetic and real data.
Abstract
Fairness-aware machine learning has attracted a surge of attention in many domains, such as online advertising, personalized recommendation, and social media analysis in web applications. Fairness-aware machine learning aims to eliminate biases of learning models against certain subgroups described by certain protected (sensitive) attributes such as race, gender, and age. Among many existing fairness notions, counterfactual fairness is a popular notion defined from a causal perspective. It measures the fairness of a predictor by comparing the prediction of each individual in the original world and that in the counterfactual worlds in which the value of the sensitive attribute is modified. A prerequisite for existing methods to achieve counterfactual fairness is the prior human knowledge of the causal model for the data. However, in real-world scenarios, the underlying causal model is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
