Causal thinking for decision making on Electronic Health Records: why and how
Matthieu Doutreligne (SODA), Tristan Struja (MIT, USZ), Judith, Abecassis (SODA), Claire Morgand (ARS IDF), Leo Anthony Celi (MIT), Ga\"el, Varoquaux (SODA)

TL;DR
This paper emphasizes the importance of causal thinking over simple prediction in healthcare decision-making, providing a framework for analyzing electronic health records to draw valid causal inferences.
Contribution
It introduces a step-by-step framework for causal analysis of EHR data, highlighting pitfalls and demonstrating the process with a case study on albumin and sepsis mortality.
Findings
Framework helps build valid causal decisions from EHR data
Analysis of albumin's effect on sepsis mortality in MIMIC-IV
Guidance on pitfalls and choices in causal inference from real-world data
Abstract
Accurate predictions, as with machine learning, may not suffice to provide optimal healthcare for every patient. Indeed, prediction can be driven by shortcuts in the data, such as racial biases. Causal thinking is needed for data-driven decisions. Here, we give an introduction to the key elements, focusing on routinely-collected data, electronic health records (EHRs) and claims data. Using such data to assess the value of an intervention requires care: temporal dependencies and existing practices easily confound the causal effect. We present a step-by-step framework to help build valid decision making from real-life patient records by emulating a randomized trial before individualizing decisions, eg with machine learning. Our framework highlights the most important pitfalls and considerations in analysing EHRs or claims data to draw causal conclusions. We illustrate the various choices…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Sepsis Diagnosis and Treatment · Advanced Causal Inference Techniques
