Optimizing Medical Treatment for Sepsis in Intensive Care: from Reinforcement Learning to Pre-Trial Evaluation
Luchen Li, Ignacio Albert-Smet, and Aldo A. Faisal

TL;DR
This paper develops a reinforcement learning framework for optimizing sepsis treatment in intensive care, enabling retrospective policy evaluation and a pathway to prospective clinical testing with human expert comparison.
Contribution
It introduces a novel pre-trial evaluation framework that combines RL with human expert assessment to ensure clinical safety and effectiveness before deployment.
Findings
RL-based policies outperform traditional approaches in retrospective data.
The proposed evaluation method accurately estimates clinician versus system decision quality.
The framework supports regulatory compliance for prospective clinical testing.
Abstract
Our aim is to establish a framework where reinforcement learning (RL) of optimizing interventions retrospectively allows us a regulatory compliant pathway to prospective clinical testing of the learned policies in a clinical deployment. We focus on infections in intensive care units which are one of the major causes of death and difficult to treat because of the complex and opaque patient dynamics, and the clinically debated, highly-divergent set of intervention policies required by each individual patient, yet intensive care units are naturally data rich. In our work, we build on RL approaches in healthcare ("AI Clinicians"), and learn off-policy continuous dosing policy of pharmaceuticals for sepsis treatment using historical intensive care data under partially observable MDPs (POMDPs). POMPDs capture uncertainty in patient state better by taking in all historical information,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSepsis Diagnosis and Treatment · Machine Learning in Healthcare · Cardiac electrophysiology and arrhythmias
