Advancing Safe Mechanical Ventilation Using Offline RL With Hybrid Actions and Clinically Aligned Rewards
Muhammad Hamza Yousuf, Jason Li, Sahar Vahdati, Raphael Theilen, Jakob Wittenstein, Jens Lehmann

TL;DR
This paper presents a novel offline reinforcement learning approach for optimizing mechanical ventilation settings, handling hybrid actions directly, and incorporating clinically aligned rewards to improve patient outcomes in ICU settings.
Contribution
It introduces a scalable offline RL method with factored action critics for hybrid actions and a multiobjective reward framework aligned with clinical goals.
Findings
Enhanced optimization of six ventilator settings
Improved safety and performance by avoiding discretization pitfalls
Clinically grounded reward function promotes equitable outcomes
Abstract
Invasive mechanical ventilation (MV) is a life-sustaining therapy commonly used in the intensive care unit (ICU) for patients with severe and acute conditions. These patients frequently rely on MV for breathing. Given the high risk of death in such cases, optimal MV settings can reduce mortality, minimize ventilator-induced lung injury, shorten ICU stays, and ease the strain on healthcare resources. However, optimizing MV settings remains a complex and error-prone process due to patient-specific variability. While Offline Reinforcement Learning (RL) shows promise for optimizing MV settings, current methods struggle with the hybrid (continuous and discrete) nature of MV settings. Discretizing continuous settings leads to exponential growth in the action space, which limits the number of optimizable settings. Converting the predictions back to continuous can cause a distribution shift,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsRespiratory Support and Mechanisms
