Power Constrained Nonstationary Bandits with Habituation and Recovery Dynamics
Fengxu Li, Stephanie M. Carpenter, Matthew P. Buman, Yonatan Mintz

TL;DR
This paper introduces ROGUE-TS, a Thompson Sampling algorithm for nonstationary bandits with habituation and recovery, providing theoretical guarantees and practical methods to balance personalized learning with population-level effects.
Contribution
The paper develops a novel Thompson Sampling algorithm for the ROGUE bandit framework and introduces a probability clipping method to balance exploration and exploitation in nonstationary settings.
Findings
Lower regret compared to existing methods.
Maintains high statistical power in MRT datasets.
Effectively balances personalization and population-level learning.
Abstract
A common challenge for decision makers is selecting actions whose rewards are unknown and evolve over time based on prior policies. For instance, repeated use may reduce an action's effectiveness (habituation), while inactivity may restore it (recovery). These nonstationarities are captured by the Reducing or Gaining Unknown Efficacy (ROGUE) bandit framework, which models real-world settings such as behavioral health interventions. While existing algorithms can compute sublinear regret policies to optimize these settings, they may not provide sufficient exploration due to overemphasis on exploitation, limiting the ability to estimate population-level effects. This is a challenge of particular interest in micro-randomized trials (MRTs) that aid researchers in developing just-in-time adaptive interventions that have population-level effects while still providing personalized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Advanced Causal Inference Techniques · Digital Mental Health Interventions
