BLOB : A Probabilistic Model for Recommendation that Combines Organic and Bandit Signals
Otmane Sakhi, Stephen Bonner, David Rohde, Flavian Vasile

TL;DR
This paper introduces BLOB, a Bayesian probabilistic model that effectively combines organic browsing data and bandit feedback signals to enhance recommendation accuracy and learning efficiency.
Contribution
The paper presents a novel Bayesian model that integrates organic and bandit signals using variational auto-encoders, improving recommendation quality over existing methods.
Findings
Outperforms state-of-the-art organic recommendation algorithms.
Matches or exceeds bandit-based methods in various environments.
Efficiently leverages organic signals to learn from bandit feedback.
Abstract
A common task for recommender systems is to build a pro le of the interests of a user from items in their browsing history and later to recommend items to the user from the same catalog. The users' behavior consists of two parts: the sequence of items that they viewed without intervention (the organic part) and the sequences of items recommended to them and their outcome (the bandit part). In this paper, we propose Bayesian Latent Organic Bandit model (BLOB), a probabilistic approach to combine the 'or-ganic' and 'bandit' signals in order to improve the estimation of recommendation quality. The bandit signal is valuable as it gives direct feedback of recommendation performance, but the signal quality is very uneven, as it is highly concentrated on the recommendations deemed optimal by the past version of the recom-mender system. In contrast, the organic signal is typically strong and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
