TL;DR
This paper highlights the importance of user state representation in recommender systems using contextual bandits, showing that embedding quality significantly impacts performance and varies across datasets.
Contribution
It systematically evaluates how different embedding-based user state representations affect bandit algorithm performance, revealing the critical role of state construction.
Findings
Variations in user state embeddings can outperform changes in bandit algorithms.
No single embedding strategy is best across all datasets.
Emphasizes the need for domain-specific evaluation of state representations.
Abstract
With the increasing availability of online information, recommender systems have become an important tool for many web-based systems. Due to the continuous aspect of recommendation environments, these systems increasingly rely on contextual multi-armed bandits (CMAB) to deliver personalized and real-time suggestions. A critical yet underexplored component in these systems is the representation of user state, which typically encapsulates the user's interaction history and is deeply correlated with the model's decisions and learning. In this paper, we investigate the impact of different embedding-based state representations derived from matrix factorization models on the performance of traditional CMAB algorithms. Our large-scale experiments reveal that variations in state representation can lead to improvements greater than those achieved by changing the bandit algorithm itself.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
