Eliciting associations between clinical variables from LLMs via comparison questions across populations
Fabian Kabus, Kian Kordtomeikel, Thomas Brox, Heinz Wiendl, Daiana Stolz, and Harald Binder

TL;DR
This paper presents a method to extract meaningful correlation and causal information from large language models in biomedical contexts using structured comparison questions and statistical modeling, enabling invariant causal predictions.
Contribution
It introduces a novel approach combining triplet comparison questions with statistical models to infer correlations and causal links from LLMs without internal access.
Findings
Correlations elicited are smooth, stable, and clinically interpretable.
Method supports invariant causal prediction with a small set of candidate links.
Approach demonstrated in COPD and MS clinical domains.
Abstract
The training data of large language models (LLMs) comprises a wide range of biomedical literature, reflecting data from many different patient populations. We investigate how it might be possible to recover information on correlation and causal links between patient characteristics, as a key building block for medical decision making. To avoid the pitfalls of direct elicitation, we propose an approach based on structured comparison questions, specifically patient comparison triplet questions. This is combined with a statistical model for the LLM representation that provides estimates of correlations without access to activations or model internals. Intuitively, we consider how similarity decisions of LLMs based on a first variable are affected by providing information on a second variable for one of the patients being assessed. We then induce prompt-level environment shifts to obtain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
