Causal schema induction for knowledge discovery
Michael Regan, Jena D. Hwang, Keisuke Sakaguchi, James, Pustejovsky

TL;DR
This paper introduces Torquestra, a curated dataset combining temporal, event, and causal structures to improve causal schema induction for knowledge discovery in news texts, demonstrating the effectiveness of causal-aware models.
Contribution
The paper presents a new dataset, Torquestra, and benchmarks models for causal schema induction, addressing data scarcity and enhancing causal reasoning in text analysis.
Findings
Causal structure-aware models outperform lexical-based methods.
Torquestra dataset effectively supports knowledge discovery tasks.
Models successfully identify texts with similar causal components.
Abstract
Making sense of familiar yet new situations typically involves making generalizations about causal schemas, stories that help humans reason about event sequences. Reasoning about events includes identifying cause and effect relations shared across event instances, a process we refer to as causal schema induction. Statistical schema induction systems may leverage structural knowledge encoded in discourse or the causal graphs associated with event meaning, however resources to study such causal structure are few in number and limited in size. In this work, we investigate how to apply schema induction models to the task of knowledge discovery for enhanced search of English-language news texts. To tackle the problem of data scarcity, we present Torquestra, a manually curated dataset of text-graph-schema units integrating temporal, event, and causal structures. We benchmark our dataset on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Text Analysis Techniques · Semantic Web and Ontologies
