Large Language Models as Co-Pilots for Causal Inference in Medical Studies
Ahmed Alaa, Rachael V. Phillips, Emre K{\i}c{\i}man, Laura B. Balzer,, Mark van der Laan, Maya Petersen

TL;DR
This paper investigates how large language models can serve as co-pilots to assist medical researchers in identifying and addressing biases and flaws in causal inference from observational clinical data.
Contribution
It introduces a conceptual framework for using LLMs as causal co-pilots, integrating domain knowledge and natural language interaction to improve study design validity.
Findings
Proposes a structured framework for LLMs in causal inference
Provides illustrative examples of LLMs assisting in study design
Highlights challenges and opportunities in deploying LLMs for epidemiology
Abstract
The validity of medical studies based on real-world clinical data, such as observational studies, depends on critical assumptions necessary for drawing causal conclusions about medical interventions. Many published studies are flawed because they violate these assumptions and entail biases such as residual confounding, selection bias, and misalignment between treatment and measurement times. Although researchers are aware of these pitfalls, they continue to occur because anticipating and addressing them in the context of a specific study can be challenging without a large, often unwieldy, interdisciplinary team with extensive expertise. To address this expertise gap, we explore the use of large language models (LLMs) as co-pilot tools to assist researchers in identifying study design flaws that undermine the validity of causal inferences. We propose a conceptual framework for LLMs as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Computational and Text Analysis Methods
MethodsAttentive Walk-Aggregating Graph Neural Network · Causal inference
