TL;DR
This paper introduces a query-level click propensity model and a DualIPW method to improve unbiased learning-to-rank by addressing relevance saturation bias and position bias, demonstrating significant performance gains on real-world data.
Contribution
It proposes a novel query-level click propensity model and a DualIPW mechanism that jointly address relevance saturation and position bias in unbiased learning-to-rank.
Findings
DualIPW outperforms existing ULTR methods on Baidu-ULTR dataset.
Theoretical proof shows DualIPW provides unbiased ranking estimates.
Experiments demonstrate robustness to relevance saturation bias.
Abstract
Most existing unbiased learning-to-rank (ULTR) approaches are based on the user examination hypothesis, which assumes that users will click a result only if it is both relevant and observed (typically modeled by position). However, in real-world scenarios, users often click only one or two results after examining multiple relevant options, due to limited patience or because their information needs have already been satisfied. Motivated by this, we propose a query-level click propensity model to capture the probability that users will click on different result lists, allowing for non-zero probabilities that users may not click on an observed relevant result. We hypothesize that this propensity increases when more potentially relevant results are present, and refer to this user behavior as relevance saturation bias. Our method introduces a Dual Inverse Propensity Weighting (DualIPW)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
