Formally Verifying and Explaining Sepsis Treatment Policies with COOL-MC
Dennis Gross

TL;DR
COOL-MC is a novel framework that combines formal verification and explainability to analyze and debug reinforcement learning policies for sepsis treatment, making them safer and more interpretable in critical healthcare settings.
Contribution
It introduces a method to construct reachable state spaces, label states with clinical propositions, and integrate explainability with probabilistic model checking for healthcare policies.
Findings
Verified safety bounds for sepsis treatment policies.
Trained a policy with optimal survival probability.
Revealed reliance on dosing history over patient condition.
Abstract
Safe and interpretable sequential decision-making is critical in healthcare, yet reinforcement learning (RL) policies for sepsis treatment optimization remain opaque and difficult to verify. Standard probabilistic model checkers operate on the full state space, which becomes infeasible for larger MDPs, and cannot explain why a learned policy makes particular decisions. COOL-MC wraps the model checker Storm but adds three key capabilities: it constructs only the reachable state space induced by a trained policy, yielding a smaller discrete-time Markov chain amenable to verification even when full-MDP analysis is intractable; it automatically labels states with clinically meaningful atomic propositions; and it integrates explainability methods with probabilistic computation tree logic (PCTL) queries to reveal which features drive decisions across treatment trajectories. We demonstrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSepsis Diagnosis and Treatment · Machine Learning in Healthcare · Explainable Artificial Intelligence (XAI)
