Guardian-regularized Safe Offline Reinforcement Learning for Smart Weaning of Mechanical Circulatory Devices

Aysin Tumay; Sophia Sun; Sonia Fereidooni; Aaron Dumas; Elise Jortberg; Rose Yu

arXiv:2511.06111·cs.LG·November 11, 2025

Guardian-regularized Safe Offline Reinforcement Learning for Smart Weaning of Mechanical Circulatory Devices

Aysin Tumay, Sophia Sun, Sonia Fereidooni, Aaron Dumas, Elise Jortberg, Rose Yu

PDF

Open Access

TL;DR

This paper introduces a novel offline reinforcement learning framework for safely automating the weaning process of mechanical circulatory support devices in heart failure patients, combining a regularized policy optimization algorithm with a probabilistic digital twin.

Contribution

The paper proposes CORMPO, a clinically-aware, density-regularized offline RL algorithm, and a Transformer-based digital twin for modeling circulatory dynamics, advancing safe decision-making in medical settings.

Findings

01

CORMPO outperforms baseline offline RL methods by 28% in reward.

02

The approach improves clinical metric scores by 82.6%.

03

Theoretical guarantees are established for CORMPO.

Abstract

We study the sequential decision-making problem for automated weaning of mechanical circulatory support (MCS) devices in cardiogenic shock patients. MCS devices are percutaneous micro-axial flow pumps that provide left ventricular unloading and forward blood flow, but current weaning strategies vary significantly across care teams and lack data-driven approaches. Offline reinforcement learning (RL) has proven to be successful in sequential decision-making tasks, but our setting presents challenges for training and evaluating traditional offline RL methods: prohibition of online patient interaction, highly uncertain circulatory dynamics due to concurrent treatments, and limited data availability. We developed an end-to-end machine learning framework with two key contributions (1) Clinically-aware OOD-regularized Model-based Policy Optimization (CORMPO), a density-regularized offline RL…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMechanical Circulatory Support Devices · Cardiac electrophysiology and arrhythmias · Reinforcement Learning in Robotics