Cross-document Event Identity via Dense Annotation
Adithya Pratapa, Zhengzhong Liu, Kimihiro Hasegawa, Linwei Li, Yukari, Yamakawa, Shikun Zhang, Teruko Mitamura

TL;DR
This paper introduces a dense annotation method and dataset for cross-document event coreference, addressing limitations of prior work by capturing richer event mentions and contextual information to better understand event identity.
Contribution
It presents a new dense annotation workflow, a comprehensive dataset, and an open-source toolkit for improved cross-document event coreference research.
Findings
Rich annotations include event links, time, location, and participants.
The dataset enables better understanding of event identity across documents.
Open-source toolkit facilitates further research.
Abstract
In this paper, we study the identity of textual events from different documents. While the complex nature of event identity is previously studied (Hovy et al., 2013), the case of events across documents is unclear. Prior work on cross-document event coreference has two main drawbacks. First, they restrict the annotations to a limited set of event types. Second, they insufficiently tackle the concept of event identity. Such annotation setup reduces the pool of event mentions and prevents one from considering the possibility of quasi-identity relations. We propose a dense annotation approach for cross-document event coreference, comprising a rich source of event mentions and a dense annotation effort between related document pairs. To this end, we design a new annotation workflow with careful quality control and an easy-to-use annotation interface. In addition to the links, we further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
