Pruning the Path to Optimal Care: Identifying Systematically Suboptimal   Medical Decision-Making with Inverse Reinforcement Learning

Inko Bovenzi; Adi Carmel; Michael Hu; Rebecca M. Hurwitz; Fiona; McBride; Leo Benac; Jos\'e Roberto Tello Ayala; Finale Doshi-Velez

arXiv:2411.05237·cs.LG·November 11, 2024

Pruning the Path to Optimal Care: Identifying Systematically Suboptimal Medical Decision-Making with Inverse Reinforcement Learning

Inko Bovenzi, Adi Carmel, Michael Hu, Rebecca M. Hurwitz, Fiona, McBride, Leo Benac, Jos\'e Roberto Tello Ayala, Finale Doshi-Velez

PDF

TL;DR

This paper introduces a novel IRL-based method to identify suboptimal medical decisions in ICU data by comparing clinician actions to peer consensus, revealing insights into clinical priorities and disparities.

Contribution

It presents a two-stage IRL approach with trajectory pruning to detect suboptimal decisions, advancing understanding of clinical decision-making from observational data.

Findings

01

Suboptimal actions vary by disease

02

Removing suboptimal actions impacts demographic groups differently

03

IRL can uncover clinical priorities from observational data

Abstract

In aims to uncover insights into medical decision-making embedded within observational data from clinical settings, we present a novel application of Inverse Reinforcement Learning (IRL) that identifies suboptimal clinician actions based on the actions of their peers. This approach centers two stages of IRL with an intermediate step to prune trajectories displaying behavior that deviates significantly from the consensus. This enables us to effectively identify clinical priorities and values from ICU data containing both optimal and suboptimal clinician decisions. We observe that the benefits of removing suboptimal actions vary by disease and differentially impact certain demographic groups.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.