To Model or to Intervene: A Comparison of Counterfactual and Online Learning to Rank from User Interactions
Rolf Jagerman, Harrie Oosterhuis, Maarten de Rijke

TL;DR
This paper compares counterfactual and online learning to rank methods, revealing that their effectiveness depends on bias and noise levels, guiding practitioners in choosing the appropriate approach for different scenarios.
Contribution
First benchmarking study comparing counterfactual and online LTR methods under various conditions, highlighting their strengths and limitations.
Findings
Counterfactual methods excel with low bias and noise.
Online methods are robust to bias and noise.
Choice of method depends on bias, position bias, and interaction noise levels.
Abstract
Learning to Rank (LTR) from user interactions is challenging as user feedback often contains high levels of bias and noise. At the moment, two methodologies for dealing with bias prevail in the field of LTR: counterfactual methods that learn from historical data and model user behavior to deal with biases; and online methods that perform interventions to deal with bias but use no explicit user models. For practitioners the decision between either methodology is very important because of its direct impact on end users. Nevertheless, there has never been a direct comparison between these two approaches to unbiased LTR. In this study we provide the first benchmarking of both counterfactual and online LTR methods under different experimental conditions. Our results show that the choice between the methodologies is consequential and depends on the presence of selection bias, and the degree…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Information Retrieval and Search Behavior · Economic and Environmental Valuation
