TL;DR
This paper introduces a framework that decomposes data representations to improve fairness in machine learning models, balancing predictive accuracy and bias mitigation through subspace adjustments and influence analysis.
Contribution
It proposes a novel subspace decomposition method combined with influence functions to control fairness-utility trade-offs, advancing fairness techniques beyond prediction-level constraints.
Findings
Theoretical analysis of how shared subspaces affect error and fairness.
Influence functions quantify the impact of subspace adjustments on estimates.
Empirical results show improved fairness with maintained accuracy.
Abstract
Machine learning models have achieved widespread success but often inherit and amplify historical biases, resulting in unfair outcomes. Traditional fairness methods typically impose constraints at the prediction level, without addressing underlying biases in data representations. In this work, we propose a principled framework that adjusts data representations to balance predictive utility and fairness. Using sufficient dimension reduction, we decompose the feature space into target-relevant, sensitive, and shared components, and control the fairness-utility trade-off by selectively removing sensitive information. We provide a theoretical analysis of how prediction error and fairness gaps evolve as shared subspaces are added, and employ influence functions to quantify their effects on the asymptotic behavior of parameter estimates. Experiments on both synthetic and real-world datasets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
