Fair Infinitesimal Jackknife: Mitigating the Influence of Biased Training Data Points Without Refitting
Prasanna Sattigeri, Soumya Ghosh, Inkit Padhi, Pierre Dognin, Kush R., Varshney

TL;DR
This paper introduces a method to enhance fairness in pre-trained classifiers by selectively removing training data points based on influence on fairness metrics, without needing to refit the model.
Contribution
It proposes a novel infinitesimal jackknife-based influence measure for data removal to improve fairness without sacrificing predictive accuracy.
Findings
Significant fairness improvements across multiple tasks
Minimal impact on predictive performance
Outperforms existing fairness intervention methods
Abstract
In consequential decision-making applications, mitigating unwanted biases in machine learning models that yield systematic disadvantage to members of groups delineated by sensitive attributes such as race and gender is one key intervention to strive for equity. Focusing on demographic parity and equality of opportunity, in this paper we propose an algorithm that improves the fairness of a pre-trained classifier by simply dropping carefully selected training data points. We select instances based on their influence on the fairness metric of interest, computed using an infinitesimal jackknife-based approach. The dropping of training points is done in principle, but in practice does not require the model to be refit. Crucially, we find that such an intervention does not substantially reduce the predictive performance of the model but drastically improves the fairness metric. Through…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsEthics and Social Impacts of AI
