MAVEN-Fact: A Large-scale Event Factuality Detection Dataset

Chunyang Li; Hao Peng; Xiaozhi Wang; Yunjia Qi; Lei Hou; Bin Xu,; Juanzi Li

arXiv:2407.15352·cs.CL·July 23, 2024

MAVEN-Fact: A Large-scale Event Factuality Detection Dataset

Chunyang Li, Hao Peng, Xiaozhi Wang, Yunjia Qi, Lei Hou, Bin Xu,, Juanzi Li

PDF

Open Access 2 Repos 1 Video

TL;DR

MAVEN-Fact introduces the largest high-quality dataset for event factuality detection, enabling better understanding of event knowledge and highlighting the differing impacts of event arguments on models.

Contribution

It provides a large-scale, high-quality EFD dataset based on MAVEN, supporting new analyses and applications in event factuality detection.

Findings

01

MAVEN-Fact is challenging for models and LLMs.

02

Event arguments aid fine-tuned models but not LLMs.

03

Event factuality detection can reduce hallucinations in LLMs.

Abstract

Event Factuality Detection (EFD) task determines the factuality of textual events, i.e., classifying whether an event is a fact, possibility, or impossibility, which is essential for faithfully understanding and utilizing event knowledge. However, due to the lack of high-quality large-scale data, event factuality detection is under-explored in event understanding research, which limits the development of EFD community. To address these issues and provide faithful event understanding, we introduce MAVEN-Fact, a large-scale and high-quality EFD dataset based on the MAVEN dataset. MAVEN-Fact includes factuality annotations of 112,276 events, making it the largest EFD dataset. Extensive experiments demonstrate that MAVEN-Fact is challenging for both conventional fine-tuned models and large language models (LLMs). Thanks to the comprehensive annotations of event arguments and relations in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

MAVEN-FACT: A Large-scale Event Factuality Detection Dataset· underline

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Big Data Technologies and Applications · Digital and Cyber Forensics