Large Language Models for Causal Relations Extraction in Social Media: A Validation Framework for Disaster Intelligence

Ujun Jeong; Saketh Vishnubhatla; Bohan Jiang; Andre Harrison; Adrienne Raglin; Huan Liu

arXiv:2605.11348·cs.CL·May 13, 2026

Large Language Models for Causal Relations Extraction in Social Media: A Validation Framework for Disaster Intelligence

Ujun Jeong, Saketh Vishnubhatla, Bohan Jiang, Andre Harrison, Adrienne Raglin, Huan Liu

PDF

TL;DR

This paper evaluates the effectiveness of Large Language Models in extracting causal relations from disaster-related social media posts, proposing an evaluation framework and analyzing potential and risks.

Contribution

It introduces an expert-grounded evaluation framework for LLMs in causal relation extraction and assesses their reliability in disaster contexts.

Findings

01

LLMs can generate causal graphs that partially align with reference data.

02

There are significant risks of LLMs reflecting prior knowledge rather than actual post-event evidence.

03

The framework helps identify when LLMs provide trustworthy causal information.

Abstract

During disasters, extracting causal relations from social media can strengthen situational awareness by identifying factors linked to casualties, physical damage, infrastructure disruption, and cascading impacts. However, disaster-related posts are often informal, fragmented, and context-dependent, and they may describe personal experiences rather than explicit causal relations. In this work, we examine whether Large Language Models (LLMs) can effectively extract causal relations from disaster-related social media posts. To this end, we (1) propose an expert-grounded evaluation framework that compares LLM-generated causal graphs with reference graphs derived from disaster-specific reports and (2) assess whether the extracted relations are supported by post-event evidence or instead reflect model priors. Our findings highlight both the potential and risks of using LLMs for causal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.