Context-Aware Reasoning On Parametric Knowledge for Inferring Causal Variables

Ivaxi Sheth; Sahar Abdelnabi; Mario Fritz

arXiv:2409.02604·cs.LG·September 26, 2025

Context-Aware Reasoning On Parametric Knowledge for Inferring Causal Variables

Ivaxi Sheth, Sahar Abdelnabi, Mario Fritz

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper presents a new benchmark for causal graph completion, demonstrating that large language models can reason about backdoor variables in causal inference tasks beyond memorization.

Contribution

It introduces a novel, challenging benchmark for causal reasoning that tests LLMs' ability to infer hidden variables in context-dependent scenarios.

Findings

01

LLMs can hypothesize backdoor variables effectively.

02

The benchmark includes over 4000 queries with varying difficulty.

03

Reasoning requires context-aware inference beyond memorization.

Abstract

Scientific discovery catalyzes human intellectual advances, driven by the cycle of hypothesis generation, experimental design, evaluation, and assumption refinement. Central to this process is causal inference, uncovering the mechanisms behind observed phenomena. While randomized experiments provide strong inferences, they are often infeasible due to ethical or practical constraints. However, observational studies are prone to confounding or mediating biases. While crucial, identifying such backdoor paths is expensive and heavily depends on scientists' domain knowledge to generate hypotheses. We introduce a novel benchmark where the objective is to complete a partial causal graph. We design a benchmark with varying difficulty levels with over 4000 queries. We show the strong ability of LLMs to hypothesize the backdoor variables between a cause and its effect. Unlike simple knowledge…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ivaxi0s/hypothesizing-causal-variable-llm
noneOfficial

Videos

Context-Aware Reasoning On Parametric Knowledge for Inferring Causal Variables· underline

Taxonomy

TopicsNatural Language Processing Techniques · Bayesian Modeling and Causal Inference · Data Quality and Management

MethodsAttention Is All You Need · Byte Pair Encoding · Absolute Position Encodings · Softmax · Label Smoothing · Dropout · Layer Normalization · Position-Wise Feed-Forward Layer · Linear Layer · Adam