Guardian-regularized Safe Offline Reinforcement Learning for Smart Weaning of Mechanical Circulatory Devices
Aysin Tumay, Sophia Sun, Sonia Fereidooni, Aaron Dumas, Elise Jortberg, Rose Yu

TL;DR
This paper introduces a novel offline reinforcement learning framework for safely automating the weaning process of mechanical circulatory support devices in heart failure patients, combining a regularized policy optimization algorithm with a probabilistic digital twin.
Contribution
The paper proposes CORMPO, a clinically-aware, density-regularized offline RL algorithm, and a Transformer-based digital twin for modeling circulatory dynamics, advancing safe decision-making in medical settings.
Findings
CORMPO outperforms baseline offline RL methods by 28% in reward.
The approach improves clinical metric scores by 82.6%.
Theoretical guarantees are established for CORMPO.
Abstract
We study the sequential decision-making problem for automated weaning of mechanical circulatory support (MCS) devices in cardiogenic shock patients. MCS devices are percutaneous micro-axial flow pumps that provide left ventricular unloading and forward blood flow, but current weaning strategies vary significantly across care teams and lack data-driven approaches. Offline reinforcement learning (RL) has proven to be successful in sequential decision-making tasks, but our setting presents challenges for training and evaluating traditional offline RL methods: prohibition of online patient interaction, highly uncertain circulatory dynamics due to concurrent treatments, and limited data availability. We developed an end-to-end machine learning framework with two key contributions (1) Clinically-aware OOD-regularized Model-based Policy Optimization (CORMPO), a density-regularized offline RL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMechanical Circulatory Support Devices · Cardiac electrophysiology and arrhythmias · Reinforcement Learning in Robotics
