The Bandit's Blind Spot: The Critical Role of User State Representation in Recommender Systems

Pedro R. Pires; Gregorio F. Azevedo; Rafael T. Sereicikas; Pietro L. Campos; Tiago A. Almeida

arXiv:2604.26651·cs.IR·April 30, 2026

The Bandit's Blind Spot: The Critical Role of User State Representation in Recommender Systems

Pedro R. Pires, Gregorio F. Azevedo, Rafael T. Sereicikas, Pietro L. Campos, Tiago A. Almeida

PDF

1 Repo

TL;DR

This paper highlights the importance of user state representation in recommender systems using contextual bandits, showing that embedding quality significantly impacts performance and varies across datasets.

Contribution

It systematically evaluates how different embedding-based user state representations affect bandit algorithm performance, revealing the critical role of state construction.

Findings

01

Variations in user state embeddings can outperform changes in bandit algorithms.

02

No single embedding strategy is best across all datasets.

03

Emphasizes the need for domain-specific evaluation of state representations.

Abstract

With the increasing availability of online information, recommender systems have become an important tool for many web-based systems. Due to the continuous aspect of recommendation environments, these systems increasingly rely on contextual multi-armed bandits (CMAB) to deliver personalized and real-time suggestions. A critical yet underexplored component in these systems is the representation of user state, which typically encapsulates the user's interaction history and is deeply correlated with the model's decisions and learning. In this paper, we investigate the impact of different embedding-based state representations derived from matrix factorization models on the performance of traditional CMAB algorithms. Our large-scale experiments reveal that variations in state representation can lead to improvements greater than those achieved by changing the bandit algorithm itself.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

UFSCar-LaSID/bandits_blind_spot
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.