Evaluating Reinforcement Learning Algorithms in Observational Health Settings
Omer Gottesman, Fredrik Johansson, Joshua Meier, Jack Dent, Donghun, Lee, Srivatsan Srinivasan, Linying Zhang, Yi Ding, David Wihl, Xuefeng Peng,, Jiayu Yao, Isaac Lage, Christopher Mosch, Li-wei H. Lehman, Matthieu, Komorowski, Matthieu Komorowski, Aldo Faisal, Leo Anthony Celi

TL;DR
This paper discusses the challenges and considerations in evaluating reinforcement learning algorithms for healthcare decision-making, emphasizing the importance of careful assessment to ensure reliable and effective treatment policies.
Contribution
It highlights key subtleties in evaluating RL in healthcare, offering guidance to avoid misleading conclusions and improve policy assessment methods.
Findings
Evaluation metrics can be biased by how patient history is summarized.
Statistical estimator variance affects the reliability of policy quality estimates.
Confounders in observational data can lead to misleading policy evaluations.
Abstract
Much attention has been devoted recently to the development of machine learning algorithms with the goal of improving treatment policies in healthcare. Reinforcement learning (RL) is a sub-field within machine learning that is concerned with learning how to make sequences of decisions so as to optimize long-term effects. Already, RL algorithms have been proposed to identify decision-making strategies for mechanical ventilation, sepsis management and treatment of schizophrenia. However, before implementing treatment policies learned by black-box algorithms in high-stakes clinical decision problems, special care must be taken in the evaluation of these policies. In this document, our goal is to expose some of the subtleties associated with evaluating RL algorithms in healthcare. We aim to provide a conceptual starting point for clinical and computational researchers to ask the right…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Explainable Artificial Intelligence (XAI) · Health Systems, Economic Evaluations, Quality of Life
