Modeling Attrition in Recommender Systems with Departing Bandits

Omer Ben-Porat; Lee Cohen; Liu Leqi; Zachary C. Lipton; Yishay Mansour

arXiv:2203.13423·cs.LG·February 19, 2024·1 cites

Modeling Attrition in Recommender Systems with Departing Bandits

Omer Ben-Porat, Lee Cohen, Liu Leqi, Zachary C. Lipton, Yishay Mansour

PDF

Open Access 1 Video

TL;DR

This paper introduces a new bandit model for recommender systems that accounts for user departure due to dissatisfaction, providing algorithms with optimal or near-optimal regret bounds for different user type scenarios.

Contribution

It proposes a novel bandit framework incorporating user departure, and develops efficient algorithms with provable regret bounds for single and multiple user types.

Findings

01

Optimal UCB-based algorithm for single user type.

02

Efficient learning algorithm for two user types with $ ilde{O}( oot{2}T)$ regret.

03

Addresses policy-dependent horizons in recommender systems.

Abstract

Traditionally, when recommender systems are formalized as multi-armed bandits, the policy of the recommender system influences the rewards accrued, but not the length of interaction. However, in real-world systems, dissatisfied users may depart (and never come back). In this work, we propose a novel multi-armed bandit setup that captures such policy-dependent horizons. Our setup consists of a finite set of user types, and multiple arms with Bernoulli payoffs. Each (user type, arm) tuple corresponds to an (unknown) reward probability. Each user's type is initially unknown and can only be inferred through their response to recommendations. Moreover, if a user is dissatisfied with their recommendation, they might depart the system. We first address the case where all users share the same type, demonstrating that a recent UCB-based algorithm is optimal. We then move forward to the more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Modeling Attrition in Recommender Systems with Departing Bandits· underline

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management · Recommender Systems and Techniques