Regret in Online Recommendation Systems

Kaito Ariu; Narae Ryu; Se-Young Yun; Alexandre Prouti\`ere

arXiv:2010.12363·stat.ML·October 26, 2020·1 cites

Regret in Online Recommendation Systems

Kaito Ariu, Narae Ryu, Se-Young Yun, Alexandre Prouti\`ere

PDF

Open Access 1 Video

TL;DR

This paper provides a theoretical framework for analyzing regret in online recommendation systems, considering user-item preferences, structural assumptions, and the impact of constraints like not recommending the same item twice to a user.

Contribution

It introduces a comprehensive regret analysis for online recommendation algorithms under various structural assumptions and constraints, deriving lower bounds and optimal algorithms.

Findings

01

Regret lower bounds are established for different structural assumptions.

02

Algorithms are devised that achieve these regret bounds.

03

The analysis highlights the impact of constraints and structure learning on regret.

Abstract

This paper proposes a theoretical analysis of recommendation systems in an online setting, where items are sequentially recommended to users over time. In each round, a user, randomly picked from a population of $m$ users, requests a recommendation. The decision-maker observes the user and selects an item from a catalogue of $n$ items. Importantly, an item cannot be recommended twice to the same user. The probabilities that a user likes each item are unknown. The performance of the recommendation algorithm is captured through its regret, considering as a reference an Oracle algorithm aware of these probabilities. We investigate various structural assumptions on these probabilities: we derive for each structure regret lower bounds, and devise algorithms achieving these limits. Interestingly, our analysis reveals the relative weights of the different components of regret: the component…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Regret in Online Recommendation Systems· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Recommender Systems and Techniques