EventNarrative: A large-scale Event-centric Dataset for Knowledge Graph-to-Text Generation
Anthony Colas, Ali Sadeghian, Yue Wang, Daisy Zhe Wang

TL;DR
EventNarrative is a large-scale, high-quality dataset linking event-centric knowledge graphs with natural language text, designed to advance research in graph-to-text generation and evaluation.
Contribution
The paper introduces a novel, large-scale event-centric dataset with rich ontology and high data quality, filling a gap in graph-to-text research and enabling better model evaluation.
Findings
EventNarrative contains approximately 230,000 graph-text pairs.
The dataset is six times larger than existing parallel datasets.
Baseline evaluations demonstrate the dataset's utility for model development.
Abstract
We introduce EventNarrative, a knowledge graph-to-text dataset from publicly available open-world knowledge graphs. Given the recent advances in event-driven Information Extraction (IE), and that prior research on graph-to-text only focused on entity-driven KGs, this paper focuses on event-centric data. However, our data generation system can still be adapted to other other types of KG data. Existing large-scale datasets in the graph-to-text area are non-parallel, meaning there is a large disconnect between the KGs and text. The datasets that have a paired KG and text, are small scale and manually generated or generated without a rich ontology, making the corresponding graphs sparse. Furthermore, these datasets contain many unlinked entities between their KG and text pairs. EventNarrative consists of approximately 230,000 graphs and their corresponding natural language text, 6 times…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Semantic Web and Ontologies · Topic Modeling
