Bayesian Non-stationary Linear Bandits for Large-Scale Recommender Systems
Saeed Ghoorchian, Evgenii Kortukov, Setareh Maghsudi

TL;DR
This paper introduces a scalable, non-stationary linear bandit algorithm using random projections and exponential weighting, improving recommendation accuracy in high-dimensional, dynamic environments.
Contribution
It proposes a novel Thompson sampling-based policy for high-dimensional, non-stationary bandits that reduces computational complexity while maintaining low regret.
Findings
The algorithm achieves lower regret compared to state-of-the-art methods.
It effectively handles high-dimensional, non-stationary data in real-time.
Numerical experiments on real datasets validate its efficiency and robustness.
Abstract
Taking advantage of contextual information can potentially boost the performance of recommender systems. In the era of big data, such side information often has several dimensions. Thus, developing decision-making algorithms to cope with such a high-dimensional context in real time is essential. That is specifically challenging when the decision-maker has a variety of items to recommend. In addition, changes in items' popularity or users' preferences can hinder the performance of the deployed recommender system due to a lack of robustness to distribution shifts in the environment. In this paper, we build upon the linear contextual multi-armed bandit framework to address this problem. We develop a decision-making policy for a linear bandit problem with high-dimensional feature vectors, a large set of arms, and non-stationary reward-generating processes. Our Thompson sampling-based policy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management · Recommender Systems and Techniques
