MEE: A Novel Multilingual Event Extraction Dataset

Amir Pouran Ben Veyseh; Javid Ebrahimi; Franck Dernoncourt; and Thien; Huu Nguyen

arXiv:2211.05955·cs.CL·November 21, 2022

MEE: A Novel Multilingual Event Extraction Dataset

Amir Pouran Ben Veyseh, Javid Ebrahimi, Franck Dernoncourt, and Thien, Huu Nguyen

PDF

Open Access

TL;DR

This paper introduces MEE, a comprehensive multilingual event extraction dataset covering over 50,000 event mentions across 8 languages, aiming to advance non-English event extraction research.

Contribution

The paper presents a new high-quality multilingual EE dataset with extensive annotations, addressing the lack of resources for non-English languages in event extraction.

Findings

01

Identifies challenges in multilingual EE

02

Provides baseline results for the dataset

03

Highlights opportunities for future research

Abstract

Event Extraction (EE) is one of the fundamental tasks in Information Extraction (IE) that aims to recognize event mentions and their arguments (i.e., participants) from text. Due to its importance, extensive methods and resources have been developed for Event Extraction. However, one limitation of current research for EE involves the under-exploration for non-English languages in which the lack of high-quality multilingual EE datasets for model training and evaluation has been the main hindrance. To address this limitation, we propose a novel Multilingual Event Extraction dataset (MEE) that provides annotation for more than 50K event mentions in 8 typologically different languages. MEE comprehensively annotates data for entity mentions, event triggers and event arguments. We conduct extensive experiments on the proposed dataset to reveal challenges and opportunities for multilingual EE.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Advanced Text Analysis Techniques