Calibrating Explore-Exploit Trade-off for Fair Online Learning to Rank
Yiling Jia, Hongning Wang

TL;DR
This paper introduces a novel framework for fair online learning to rank by calibrating exploration and exploitation, ensuring group fairness without prior relevance knowledge, and demonstrates its effectiveness through theoretical guarantees and empirical results.
Contribution
It proposes a general fairness framework for OL2R that balances exploration and exploitation to achieve group fairness without needing prior relevance information.
Findings
The proposed method maintains fairness across groups during exploration.
The strategy introduces minimal regret distortion in OL2R.
Empirical results show improved fairness and relevance estimation.
Abstract
Online learning to rank (OL2R) has attracted great research interests in recent years, thanks to its advantages in avoiding expensive relevance labeling as required in offline supervised ranking model learning. Such a solution explores the unknowns (e.g., intentionally present selected results on top positions) to improve its relevance estimation. This however triggers concerns on its ranking fairness: different groups of items might receive differential treatments during the course of OL2R. But existing fair ranking solutions usually require the knowledge of result relevance or a performing ranker beforehand, which contradicts with the setting of OL2R and thus cannot be directly applied to guarantee fairness. In this work, we propose a general framework to achieve fairness defined by group exposure in OL2R. The key idea is to calibrate exploration and exploitation for fairness…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Domain Adaptation and Few-Shot Learning · Advanced Graph Neural Networks
