Reinforcement Learning for Target Zone Blood Glucose Control

David H. Mguni; Jing Dong; Wanrong Yang; Ziquan Liu; Muhammad Salman Haleem; Baoxiang Wang

arXiv:2508.03875·cs.LG·August 7, 2025

Reinforcement Learning for Target Zone Blood Glucose Control

David H. Mguni, Jing Dong, Wanrong Yang, Ziquan Liu, Muhammad Salman Haleem, Baoxiang Wang

PDF

TL;DR

This paper introduces a novel reinforcement learning framework for blood glucose control in Type 1 Diabetes, combining impulse and switching control modalities to improve safety and effectiveness in treatment management.

Contribution

The work presents a unified RL approach that models complex temporal dynamics and incorporates physiological factors, advancing safe and realistic decision-making in diabetes care.

Findings

01

Reduced blood glucose violations from 22.4% to 10.8%.

02

Provides theoretical guarantees of convergence.

03

Demonstrates empirical improvements in a stylised T1DM control task.

Abstract

Managing physiological variables within clinically safe target zones is a central challenge in healthcare, particularly for chronic conditions such as Type 1 Diabetes Mellitus (T1DM). Reinforcement learning (RL) offers promise for personalising treatment, but struggles with the delayed and heterogeneous effects of interventions. We propose a novel RL framework to study and support decision-making in T1DM technologies, such as automated insulin delivery. Our approach captures the complex temporal dynamics of treatment by unifying two control modalities: \textit{impulse control} for discrete, fast-acting interventions (e.g., insulin boluses), and \textit{switching control} for longer-acting treatments and regime shifts. The core of our method is a constrained Markov decision process augmented with physiological state features, enabling safe policy learning under clinical and resource…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.