Ranking In Generalized Linear Bandits

Amitis Shidani; George Deligiannidis; Arnaud Doucet

arXiv:2207.00109·stat.ML·January 3, 2024

Ranking In Generalized Linear Bandits

Amitis Shidani, George Deligiannidis, Arnaud Doucet

PDF

Open Access 1 Repo

TL;DR

This paper introduces algorithms for ranking in generalized linear bandits, accounting for position and item dependencies, with applications in recommendation systems, and extends existing models to more complex scenarios.

Contribution

It develops UCB and Thompson Sampling algorithms for ranking with position and item dependencies, generalizing previous models and linking the problem to graph theory.

Findings

01

Algorithms effectively handle position and item dependencies.

02

Generalizes existing ranking models to complex scenarios.

03

Connects ranking problem to graph theory for broader analysis.

Abstract

We study the ranking problem in generalized linear bandits. At each time, the learning agent selects an ordered list of items and observes stochastic outcomes. In recommendation systems, displaying an ordered list of the most attractive items is not always optimal as both position and item dependencies result in a complex reward function. A very naive example is the lack of diversity when all the most attractive items are from the same category. We model the position and item dependencies in the ordered list and design UCB and Thompson Sampling type algorithms for this problem. Our work generalizes existing studies in several directions, including position dependencies where position discount is a particular case, and connecting the ranking problem to graph theory.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shidani/rankingcontextualbandits
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Optimization and Search Problems