Challenges and Opportunities in Causality Analysis Using Large Language Models

Wlodek W. Zadrozny

PMC · DOI:10.3390/e28010023·December 24, 2025

Challenges and Opportunities in Causality Analysis Using Large Language Models

Wlodek W. Zadrozny

PDF

Open Access

TL;DR

This paper explores how large language models can extract causal relationships from text, highlighting their potential despite limitations like hallucinations.

Contribution

The paper introduces new insights into how LLMs can perform causality analysis using diverse perspectives like counterfactual and Aristotelian frameworks.

Findings

01

GPT and Gemini models achieve 88–91% agreement on causal relationships, surpassing typical inter-annotator agreement.

02

LLMs can explain and critique causal analyses, applying multiple causal perspectives effectively.

03

LLMs may help overcome annotated data sparsity and shift focus from annotations to semantic understanding of causality.

Abstract

This article examines the challenges and opportunities in extracting causal information from text with Large Language Models (LLMs). It first establishes the importance of causality extraction and then explores different views on causality, including common sense ideas informing different data annotation schemes, Aristotle’s Four Causes, and Pearl’s Ladder of Causation. The paper notes the relevance of this conceptual variety for the task. The text reviews datasets and work related to finding causal expressions, both using traditional machine learning methods and LLMs. Although the known limitations of LLMs—hallucinations and lack of common sense—affect the reliability of causal findings, GPT and Gemini models (GPT-5 and Gemini 2.5 Pro and others) show the ability to conduct causality analysis; moreover, they can even apply different perspectives, such as counterfactual and…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Diseases1

hallucinations

Figures9

Click any figure to enlarge with its caption.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods · Topic Modeling · Explainable Artificial Intelligence (XAI)