When and Whom to Collaborate with in a Changing Environment: A Collaborative Dynamic Bandit Solution
Chuanhao Li, Qingyun Wu, Hongning Wang

TL;DR
This paper introduces a collaborative dynamic bandit algorithm that models changing user preferences and dependencies over time, improving recommendation performance in non-stationary environments.
Contribution
It develops a novel Bayesian inference-based method that explicitly models environmental changes in user preferences and dependencies, maintaining sublinear regret in dynamic settings.
Findings
Maintains $ ilde O( extstyle rac{1}{ ext{sqrt}(T)}$) regret in changing environments.
Outperforms state-of-the-art solutions on synthetic and real datasets.
Highlights the importance of modeling environmental dynamics in collaborative bandit learning.
Abstract
Collaborative bandit learning, i.e., bandit algorithms that utilize collaborative filtering techniques to improve sample efficiency in online interactive recommendation, has attracted much research attention as it enjoys the best of both worlds. However, all existing collaborative bandit learning solutions impose a stationary assumption about the environment, i.e., both user preferences and the dependency among users are assumed static over time. Unfortunately, this assumption hardly holds in practice due to users' ever-changing interests and dependence relations, which inevitably costs a recommender system sub-optimal performance in practice. In this work, we develop a collaborative dynamic bandit solution to handle a changing environment for recommendation. We explicitly model the underlying changes in both user preferences and their dependency relation as a stochastic process.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Recommender Systems and Techniques · Data Stream Mining Techniques
