TL;DR
This paper investigates how errors in inferring sensitive demographic attributes affect the fairness of learning-to-rank models, comparing strategies that use hidden, inferred, or no demographic data, and finds re-ranking methods are more robust to inference errors.
Contribution
It provides a systematic empirical analysis of the impact of demographic inference errors on fair learning-to-rank strategies, highlighting the robustness of re-ranking approaches.
Findings
Fairness-aware LTR methods can increase bias with inference noise.
Fair re-ranking strategies are more robust to demographic inference errors.
Inference errors significantly impact the fairness performance of LTR models.
Abstract
As learning-to-rank models are increasingly deployed for decision-making in areas with profound life implications, the FairML community has been developing fair learning-to-rank (LTR) models. These models rely on the availability of sensitive demographic features such as race or sex. However, in practice, regulatory obstacles and privacy concerns protect this data from collection and use. As a result, practitioners may either need to promote fairness despite the absence of these features or turn to demographic inference tools to attempt to infer them. Given that these tools are fallible, this paper aims to further understand how errors in demographic inference impact the fairness performance of popular fair LTR strategies. In which cases would it be better to keep such demographic attributes hidden from models versus infer them? We examine a spectrum of fair LTR strategies ranging from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
