Learning to Rank under Multinomial Logit Choice

James A. Grant; David S. Leslie

arXiv:2009.03207·cs.LG·May 12, 2023

Learning to Rank under Multinomial Logit Choice

James A. Grant, David S. Leslie

PDF

Open Access

TL;DR

This paper introduces a multinomial logit choice model into the learning to rank framework, enabling more realistic modeling of user click behavior and proposing algorithms with theoretical regret bounds.

Contribution

It extends the LTR framework by incorporating the MNL choice model and develops UCB algorithms with proven regret bounds for both known and unknown parameters.

Findings

01

Proposed UCB algorithms achieve near-optimal regret bounds.

02

Theoretical analysis includes a lower bound of Ω(√JT) for regret.

03

New concentration results for Geometric variables and functional inequalities for MLEs.

Abstract

Learning the optimal ordering of content is an important challenge in website design. The learning to rank (LTR) framework models this problem as a sequential problem of selecting lists of content and observing where users decide to click. Most previous work on LTR assumes that the user considers each item in the list in isolation, and makes binary choices to click or not on each. We introduce a multinomial logit (MNL) choice model to the LTR framework, which captures the behaviour of users who consider the ordered list of items as a whole and make a single choice among all the items and a no-click option. Under the MNL model, the user favours items which are either inherently more attractive, or placed in a preferable position within the list. We propose upper confidence bound (UCB) algorithms to minimise regret in two settings - where the position dependent parameters are known, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Optimization and Search Problems