Context-Aware Reasoning On Parametric Knowledge for Inferring Causal Variables
Ivaxi Sheth, Sahar Abdelnabi, Mario Fritz

TL;DR
This paper presents a new benchmark for causal graph completion, demonstrating that large language models can reason about backdoor variables in causal inference tasks beyond memorization.
Contribution
It introduces a novel, challenging benchmark for causal reasoning that tests LLMs' ability to infer hidden variables in context-dependent scenarios.
Findings
LLMs can hypothesize backdoor variables effectively.
The benchmark includes over 4000 queries with varying difficulty.
Reasoning requires context-aware inference beyond memorization.
Abstract
Scientific discovery catalyzes human intellectual advances, driven by the cycle of hypothesis generation, experimental design, evaluation, and assumption refinement. Central to this process is causal inference, uncovering the mechanisms behind observed phenomena. While randomized experiments provide strong inferences, they are often infeasible due to ethical or practical constraints. However, observational studies are prone to confounding or mediating biases. While crucial, identifying such backdoor paths is expensive and heavily depends on scientists' domain knowledge to generate hypotheses. We introduce a novel benchmark where the objective is to complete a partial causal graph. We design a benchmark with varying difficulty levels with over 4000 queries. We show the strong ability of LLMs to hypothesize the backdoor variables between a cause and its effect. Unlike simple knowledge…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Bayesian Modeling and Causal Inference · Data Quality and Management
MethodsAttention Is All You Need · Byte Pair Encoding · Absolute Position Encodings · Softmax · Label Smoothing · Dropout · Layer Normalization · Position-Wise Feed-Forward Layer · Linear Layer · Adam
