Reinforcement Learning for Target Zone Blood Glucose Control
David H. Mguni, Jing Dong, Wanrong Yang, Ziquan Liu, Muhammad Salman Haleem, Baoxiang Wang

TL;DR
This paper introduces a novel reinforcement learning framework for blood glucose control in Type 1 Diabetes, combining impulse and switching control modalities to improve safety and effectiveness in treatment management.
Contribution
The work presents a unified RL approach that models complex temporal dynamics and incorporates physiological factors, advancing safe and realistic decision-making in diabetes care.
Findings
Reduced blood glucose violations from 22.4% to 10.8%.
Provides theoretical guarantees of convergence.
Demonstrates empirical improvements in a stylised T1DM control task.
Abstract
Managing physiological variables within clinically safe target zones is a central challenge in healthcare, particularly for chronic conditions such as Type 1 Diabetes Mellitus (T1DM). Reinforcement learning (RL) offers promise for personalising treatment, but struggles with the delayed and heterogeneous effects of interventions. We propose a novel RL framework to study and support decision-making in T1DM technologies, such as automated insulin delivery. Our approach captures the complex temporal dynamics of treatment by unifying two control modalities: \textit{impulse control} for discrete, fast-acting interventions (e.g., insulin boluses), and \textit{switching control} for longer-acting treatments and regime shifts. The core of our method is a constrained Markov decision process augmented with physiological state features, enabling safe policy learning under clinical and resource…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
