FairSHAP: Preprocessing for Fairness Through Attribution-Based Data Augmentation

Lin Zhu; Yijun Bian; Lei You

arXiv:2505.11111·cs.LG·February 24, 2026

FairSHAP: Preprocessing for Fairness Through Attribution-Based Data Augmentation

Lin Zhu, Yijun Bian, Lei You

PDF

Open Access 1 Repo

TL;DR

FairSHAP is a transparent preprocessing method that uses Shapley value attribution to identify and modify fairness-critical data instances, improving fairness metrics while maintaining accuracy.

Contribution

It introduces a novel, interpretable data augmentation framework leveraging Shapley values to enhance fairness in machine learning models.

Findings

01

Significantly improves demographic parity and equality of opportunity

02

Achieves fairness with minimal data perturbation

03

Maintains or improves predictive performance

Abstract

Ensuring fairness in machine learning models is critical, particularly in high-stakes domains where biased decisions can lead to serious societal consequences. Existing preprocessing approaches generally lack transparent mechanisms for identifying which features or instances are responsible for unfairness. This obscures the rationale behind data modifications. We introduce FairSHAP, a novel pre-processing framework that leverages Shapley value attribution to improve both individual and group fairness. FairSHAP identifies fairness-critical instances in the training data using an interpretable measure of feature importance, and systematically modifies them through instance-level matching across sensitive groups. This process reduces discriminative risk - an individual fairness metric - while preserving data integrity and model accuracy. We demonstrate that FairSHAP significantly improves…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

youlei202/fairshap
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning