TL;DR
This paper introduces AICM, an adversarial imitation learning framework for click models in information retrieval, which improves modeling of user behavior, reduces exposure bias, and outperforms existing methods on web search data.
Contribution
The paper proposes a novel adversarial imitation learning approach for click models, explicitly learning user utility, modeling interactions dynamically, and reducing exposure bias.
Findings
AICM outperforms state-of-the-art models in traditional click metrics.
AICM reduces exposure bias from quadratic to linear in sequence length.
AICM effectively recovers underlying click sequence patterns.
Abstract
Modern information retrieval systems, including web search, ads placement, and recommender systems, typically rely on learning from user feedback. Click models, which study how users interact with a ranked list of items, provide a useful understanding of user feedback for learning ranking models. Constructing "right" dependencies is the key of any successful click model. However, probabilistic graphical models (PGMs) have to rely on manually assigned dependencies, and oversimplify user behaviors. Existing neural network based methods promote PGMs by enhancing the expressive ability and allowing flexible dependencies, but still suffer from exposure bias and inferior estimation. In this paper, we propose a novel framework, Adversarial Imitation Click Model (AICM), based on imitation learning. Firstly, we explicitly learn the reward function that recovers users' intrinsic utility and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
