De-Biased Modelling of Search Click Behavior with Reinforcement Learning
Jianghong Zhou, Sayyed M. Zahiri, Simon Hughes, Khalifeh Al Jadda,, Surya Kallumadi, Eugene Agichtein

TL;DR
This paper introduces a reinforcement learning-based model called DRLC that reduces bias in search click data, leading to more accurate click prediction and improved search ranking performance.
Contribution
The paper proposes the De-biased Reinforcement Learning Click model (DRLC), which relaxes previous assumptions and effectively reduces bias in click logs using neural networks and reinforcement learning.
Findings
DRLC outperforms previous models in click prediction metrics.
DRLC improves ranking performance in web search.
The model demonstrates effective bias reduction in click data.
Abstract
Users' clicks on Web search results are one of the key signals for evaluating and improving web search quality and have been widely used as part of current state-of-the-art Learning-To-Rank(LTR) models. With a large volume of search logs available for major search engines, effective models of searcher click behavior have emerged to evaluate and train LTR models. However, when modeling the users' click behavior, considering the bias of the behavior is imperative. In particular, when a search result is not clicked, it is not necessarily chosen as not relevant by the user, but instead could have been simply missed, especially for lower-ranked results. These kinds of biases in the click log data can be incorporated into the click models, propagating the errors to the resulting LTR ranking models or evaluation metrics. In this paper, we propose the De-biased Reinforcement Learning Click…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
