Evaluating Reinforcement Learning Algorithms in Observational Health   Settings

Omer Gottesman; Fredrik Johansson; Joshua Meier; Jack Dent; Donghun; Lee; Srivatsan Srinivasan; Linying Zhang; Yi Ding; David Wihl; Xuefeng Peng,; Jiayu Yao; Isaac Lage; Christopher Mosch; Li-wei H. Lehman; Matthieu; Komorowski; Matthieu Komorowski; Aldo Faisal; Leo Anthony Celi; David Sontag,; Finale Doshi-Velez

arXiv:1805.12298·cs.LG·June 1, 2018·86 cites

Evaluating Reinforcement Learning Algorithms in Observational Health Settings

Omer Gottesman, Fredrik Johansson, Joshua Meier, Jack Dent, Donghun, Lee, Srivatsan Srinivasan, Linying Zhang, Yi Ding, David Wihl, Xuefeng Peng,, Jiayu Yao, Isaac Lage, Christopher Mosch, Li-wei H. Lehman, Matthieu, Komorowski, Matthieu Komorowski, Aldo Faisal, Leo Anthony Celi

PDF

Open Access

TL;DR

This paper discusses the challenges and considerations in evaluating reinforcement learning algorithms for healthcare decision-making, emphasizing the importance of careful assessment to ensure reliable and effective treatment policies.

Contribution

It highlights key subtleties in evaluating RL in healthcare, offering guidance to avoid misleading conclusions and improve policy assessment methods.

Findings

01

Evaluation metrics can be biased by how patient history is summarized.

02

Statistical estimator variance affects the reliability of policy quality estimates.

03

Confounders in observational data can lead to misleading policy evaluations.

Abstract

Much attention has been devoted recently to the development of machine learning algorithms with the goal of improving treatment policies in healthcare. Reinforcement learning (RL) is a sub-field within machine learning that is concerned with learning how to make sequences of decisions so as to optimize long-term effects. Already, RL algorithms have been proposed to identify decision-making strategies for mechanical ventilation, sepsis management and treatment of schizophrenia. However, before implementing treatment policies learned by black-box algorithms in high-stakes clinical decision problems, special care must be taken in the evaluation of these policies. In this document, our goal is to expose some of the subtleties associated with evaluating RL algorithms in healthcare. We aim to provide a conceptual starting point for clinical and computational researchers to ask the right…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Explainable Artificial Intelligence (XAI) · Health Systems, Economic Evaluations, Quality of Life