Leave-one-out Unfairness
Emily Black, Matt Fredrikson

TL;DR
This paper introduces leave-one-out unfairness, a measure of how individual predictions change with slight modifications to training data, highlighting its implications for fairness, robustness, and model behavior.
Contribution
It formalizes leave-one-out unfairness, analyzes its presence in deep models, and explores how adversarial training and smoothing techniques influence this fairness aspect.
Findings
Deep models exhibit leave-one-out unfairness even with low generalization error.
Adversarial training increases leave-one-out unfairness, while randomized smoothing decreases it.
Leave-one-out unfairness impacts practical applications requiring individual fairness.
Abstract
We introduce leave-one-out unfairness, which characterizes how likely a model's prediction for an individual will change due to the inclusion or removal of a single other person in the model's training data. Leave-one-out unfairness appeals to the idea that fair decisions are not arbitrary: they should not be based on the chance event of any one person's inclusion in the training data. Leave-one-out unfairness is closely related to algorithmic stability, but it focuses on the consistency of an individual point's prediction outcome over unit changes to the training data, rather than the error of the model in aggregate. Beyond formalizing leave-one-out unfairness, we characterize the extent to which deep models behave leave-one-out unfairly on real data, including in cases where the generalization error is small. Further, we demonstrate that adversarial training and randomized smoothing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI · Adversarial Robustness in Machine Learning
MethodsRandomized Smoothing
