Retrieving Classes of Causal Orders with Inconsistent Knowledge Bases
Federico Baldo, Simon Ferreira, Charles K. Assaad

TL;DR
This paper introduces a novel LLM-based method for deriving causal order abstractions from inconsistent knowledge bases, improving causal discovery reliability in scientific literature analysis.
Contribution
The paper presents a new approach that uses consistency scores from LLMs to construct causal order abstractions and identify causal DAGs, addressing hallucination issues.
Findings
Effectively recovers correct causal order in epidemiology literature
Constructs semi-complete partially directed graphs from pairwise scores
Maximizes consistency to identify causal structures
Abstract
Traditional causal discovery methods often depend on strong, untestable assumptions, making them unreliable in real-world applications. In this context, Large Language Models (LLMs) have emerged as a promising alternative for extracting causal knowledge from text-based metadata, effectively consolidating domain expertise. However, LLMs are prone to hallucinations, necessitating strategies that account for these limitations. One effective approach is to use a consistency measure as a proxy of reliability. Moreover, LLMs do not clearly distinguish direct from indirect causal relationships, complicating the discovery of causal Directed Acyclic Graphs (DAGs), which are often sparse. This ambiguity is evident in the way informal sentences are formulated in various domains. For this reason, focusing on causal orders provides a more practical and direct task for LLMs. We propose a new method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
