Learning to Compress Time-to-Control: A Reinforcement Learning Framework for Chronic Disease Management

Prabhjot Singh; Abhishek Gupta; Chris Betz; Abe Flansburg; Brett Ives; Sudeep Lama; Jung Hoon Son

arXiv:2605.09818·cs.LG·May 12, 2026

Learning to Compress Time-to-Control: A Reinforcement Learning Framework for Chronic Disease Management

Prabhjot Singh, Abhishek Gupta, Chris Betz, Abe Flansburg, Brett Ives, Sudeep Lama, Jung Hoon Son

PDF

TL;DR

This paper introduces a reinforcement learning framework tailored for chronic disease management, focusing on optimizing time-to-control by leveraging structural properties of chronic care and integrating preference learning.

Contribution

It formalizes a novel RL approach that exploits chronic care properties, coupling preference learning with RL, and demonstrates improved performance in simulation for hypertension and diabetes.

Findings

01

Capability-weighted offline RL outperforms uniform-weighted RL by 15 percentage points on T2D TTC.

02

Uniform-weighted RL underperforms compared to the behavior policy.

03

Epsilon-aware policies generalize across deployment regimes.

Abstract

Reinforcement learning (RL) in healthcare has had mixed results, with reward sparsity, unreliable off-policy evaluation, and deployment-simulation gap as recurring failure modes. We argue that chronic disease management is structurally a more tractable RL setting than the acute-care problems the field has primarily studied, but only if the problem is formalized to exploit chronic care's properties. We propose such a formalization. The agent's objective is to compress time-to-control (TTC) under a tiered reward calibrated to the CMS ACCESS Model. Two quantities from our companion preference-learning paper [Singh et al. 2026] enter as load-bearing structural elements: the execution intensity \epsilon bounds action availability under a constrained Markov Decision Process, and the clinician capability \kappa weights offline-data transitions during RL training. Together they couple…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.