Cascading Large Language Models for Salient Event Graph Generation

Xingwei Tan; Yuxiang Zhou; Gabriele Pergola; Yulan He

arXiv:2406.18449·cs.CL·February 11, 2025·2 cites

Cascading Large Language Models for Salient Event Graph Generation

Xingwei Tan, Yuxiang Zhou, Gabriele Pergola, Yulan He

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces CALLMSAE, a framework using cascading large language models to generate salient event graphs from long documents, reducing annotation costs and improving graph accuracy.

Contribution

It proposes a novel LLM-based cascading approach for salient event graph generation and creates NYT-SEG, a large-scale automatically annotated dataset for training and evaluation.

Findings

01

Outperforms models trained on existing datasets.

02

Generates more accurate and salient event graphs.

03

Reduces reliance on costly human annotations.

Abstract

Generating event graphs from long documents is challenging due to the inherent complexity of multiple tasks involved such as detecting events, identifying their relationships, and reconciling unstructured input with structured graphs. Recent studies typically consider all events with equal importance, failing to distinguish salient events crucial for understanding narratives. This paper presents CALLMSAE, a CAscading Large Language Model framework for SAlient Event graph generation, which leverages the capabilities of LLMs and eliminates the need for costly human annotations. We first identify salient events by prompting LLMs to generate summaries, from which salient events are identified. Next, we develop an iterative code refinement prompting strategy to generate event relation graphs, removing hallucinated relations and recovering missing edges. Powered by CALLMSAE, we present…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xingwei-warwick/callmsae
noneOfficial

Videos

Cascading Large Language Models for Salient Event Graph Generation· underline

Taxonomy

TopicsSemantic Web and Ontologies · Data Quality and Management · Topic Modeling

MethodsSparse Evolutionary Training