Bayesian Non-stationary Linear Bandits for Large-Scale Recommender   Systems

Saeed Ghoorchian; Evgenii Kortukov; Setareh Maghsudi

arXiv:2202.03167·cs.LG·July 26, 2023

Bayesian Non-stationary Linear Bandits for Large-Scale Recommender Systems

Saeed Ghoorchian, Evgenii Kortukov, Setareh Maghsudi

PDF

Open Access 1 Repo

TL;DR

This paper introduces a scalable, non-stationary linear bandit algorithm using random projections and exponential weighting, improving recommendation accuracy in high-dimensional, dynamic environments.

Contribution

It proposes a novel Thompson sampling-based policy for high-dimensional, non-stationary bandits that reduces computational complexity while maintaining low regret.

Findings

01

The algorithm achieves lower regret compared to state-of-the-art methods.

02

It effectively handles high-dimensional, non-stationary data in real-time.

03

Numerical experiments on real datasets validate its efficiency and robustness.

Abstract

Taking advantage of contextual information can potentially boost the performance of recommender systems. In the era of big data, such side information often has several dimensions. Thus, developing decision-making algorithms to cope with such a high-dimensional context in real time is essential. That is specifically challenging when the decision-maker has a variety of items to recommend. In addition, changes in items' popularity or users' preferences can hinder the performance of the deployed recommender system due to a lack of robustness to distribution shifts in the environment. In this paper, we build upon the linear contextual multi-armed bandit framework to address this problem. We develop a decision-making policy for a linear bandit problem with high-dimensional feature vectors, a large set of arms, and non-stationary reward-generating processes. Our Thompson sampling-based policy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

saeedghoorchian/d-lints-rp
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management · Recommender Systems and Techniques