An Actor-Critic Contextual Bandit Algorithm for Personalized Mobile Health Interventions
Huitian Lei, Yangyi Lu, Ambuj Tewari, Susan A. Murphy

TL;DR
This paper introduces an actor-critic algorithm tailored for real-time personalized health interventions using mobile devices, addressing a methodological gap in constructing data-driven Just-In-Time Adaptive Interventions (JITAIs).
Contribution
It formulates the real-time intervention tailoring as a contextual bandit problem with interpretability considerations and develops an online actor-critic algorithm with proven asymptotic properties.
Findings
Algorithm effectively guides JITAI construction and refinement.
Numerical experiments demonstrate robustness under various assumptions.
Asymptotic properties are theoretically established and empirically validated.
Abstract
Increasing technological sophistication and widespread use of smartphones and wearable devices provide opportunities for innovative and highly personalized health interventions. A Just-In-Time Adaptive Intervention (JITAI) uses real-time data collection and communication capabilities of modern mobile devices to deliver interventions in real-time that are adapted to the in-the-moment needs of the user. The lack of methodological guidance in constructing data-based JITAIs remains a hurdle in advancing JITAI research despite the increasing popularity of JITAIs among clinical scientists. In this article, we make a first attempt to bridge this methodological gap by formulating the task of tailoring interventions in real-time as a contextual bandit problem. Interpretability requirements in the domain of mobile health lead us to formulate the problem differently from existing formulations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management · Advanced Wireless Network Optimization
MethodsInterpretability
