TL;DR
This paper proposes an uncertainty-aware, post hoc bias mitigation method for ranking models that leverages Bayesian uncertainty estimates to improve fairness without sacrificing utility, outperforming existing baselines.
Contribution
The work introduces a novel post hoc bias mitigation technique using ranking score uncertainty, applicable to arbitrary models, with better utility-fairness trade-offs and no additional training.
Findings
Uncertainty-based method outperforms baselines in fairness-utility trade-off.
The approach is flexible and can be applied post hoc to existing models.
It requires no additional training, reducing computational costs.
Abstract
Societal biases that are contained in retrieved documents have received increased interest. Such biases, which are often prevalent in the training data and learned by the model, can cause societal harms, by misrepresenting certain groups, and by enforcing stereotypes. Mitigating such biases demands algorithms that balance the trade-off between maximized utility for the user with fairness objectives, which incentivize unbiased rankings. Prior work on bias mitigation often assumes that ranking scores, which correspond to the utility that a document holds for a user, can be accurately determined. In reality, there is always a degree of uncertainty in the estimate of expected document utility. This uncertainty can be approximated by viewing ranking models through a Bayesian perspective, where the standard deterministic score becomes a distribution. In this work, we investigate whether…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsHigh-Order Consensuses
