Preferences Evolve And So Should Your Bandits: Bandits with Evolving States for Online Platforms
Khashayar Khosravi, Renato Paes Leme, Chara Podimata, and Apostolis, Tsorvantzis

TL;DR
This paper introduces a new bandit model that accounts for deterministically evolving and unobservable states, improving learning strategies for recommendation systems and online advertising where system health and user preferences change over time.
Contribution
It proposes a novel bandit framework with evolving states, providing algorithms that adapt to different evolution rates and are robust to model misspecifications.
Findings
Algorithms perform well across various state evolution rates.
The model generalizes standard multi-armed bandits.
Robustness to model misspecification is demonstrated.
Abstract
We propose a model for learning with bandit feedback while accounting for deterministically evolving and unobservable states that we call Bandits with Deterministically Evolving States (-). The workhorse applications of our model are learning for recommendation systems and learning for online ads. In both cases, the reward that the algorithm obtains at each round is a function of the short-term reward of the action chosen and how "healthy" the system is (i.e., as measured by its state). For example, in recommendation systems, the reward that the platform obtains from a user's engagement with a particular type of content depends not only on the inherent features of the specific content, but also on how the user's preferences have evolved as a result of interacting with other types of content on the platform. Our general model accounts for the different rate at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Data Stream Mining Techniques · Smart Grid Energy Management
