Power Constrained Nonstationary Bandits with Habituation and Recovery Dynamics

Fengxu Li; Stephanie M. Carpenter; Matthew P. Buman; Yonatan Mintz

arXiv:2511.02944·cs.LG·November 6, 2025

Power Constrained Nonstationary Bandits with Habituation and Recovery Dynamics

Fengxu Li, Stephanie M. Carpenter, Matthew P. Buman, Yonatan Mintz

PDF

Open Access

TL;DR

This paper introduces ROGUE-TS, a Thompson Sampling algorithm for nonstationary bandits with habituation and recovery, providing theoretical guarantees and practical methods to balance personalized learning with population-level effects.

Contribution

The paper develops a novel Thompson Sampling algorithm for the ROGUE bandit framework and introduces a probability clipping method to balance exploration and exploitation in nonstationary settings.

Findings

01

Lower regret compared to existing methods.

02

Maintains high statistical power in MRT datasets.

03

Effectively balances personalization and population-level learning.

Abstract

A common challenge for decision makers is selecting actions whose rewards are unknown and evolve over time based on prior policies. For instance, repeated use may reduce an action's effectiveness (habituation), while inactivity may restore it (recovery). These nonstationarities are captured by the Reducing or Gaining Unknown Efficacy (ROGUE) bandit framework, which models real-world settings such as behavioral health interventions. While existing algorithms can compute sublinear regret policies to optimize these settings, they may not provide sufficient exploration due to overemphasis on exploitation, limiting the ability to estimate population-level effects. This is a challenge of particular interest in micro-randomized trials (MRTs) that aid researchers in developing just-in-time adaptive interventions that have population-level effects while still providing personalized…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Advanced Causal Inference Techniques · Digital Mental Health Interventions