Eliciting associations between clinical variables from LLMs via comparison questions across populations

Fabian Kabus; Kian Kordtomeikel; Thomas Brox; Heinz Wiendl; Daiana Stolz; and Harald Binder

arXiv:2605.06335·cs.LG·May 8, 2026

Eliciting associations between clinical variables from LLMs via comparison questions across populations

Fabian Kabus, Kian Kordtomeikel, Thomas Brox, Heinz Wiendl, Daiana Stolz, and Harald Binder

PDF

TL;DR

This paper presents a method to extract meaningful correlation and causal information from large language models in biomedical contexts using structured comparison questions and statistical modeling, enabling invariant causal predictions.

Contribution

It introduces a novel approach combining triplet comparison questions with statistical models to infer correlations and causal links from LLMs without internal access.

Findings

01

Correlations elicited are smooth, stable, and clinically interpretable.

02

Method supports invariant causal prediction with a small set of candidate links.

03

Approach demonstrated in COPD and MS clinical domains.

Abstract

The training data of large language models (LLMs) comprises a wide range of biomedical literature, reflecting data from many different patient populations. We investigate how it might be possible to recover information on correlation and causal links between patient characteristics, as a key building block for medical decision making. To avoid the pitfalls of direct elicitation, we propose an approach based on structured comparison questions, specifically patient comparison triplet questions. This is combined with a statistical model for the LLM representation that provides estimates of correlations without access to activations or model internals. Intuitively, we consider how similarity decisions of LLMs based on a first variable are affected by providing information on a second variable for one of the patients being assessed. We then induce prompt-level environment shifts to obtain…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.