Event Extraction for Portuguese: A QA-driven Approach using ACE-2005
Lu\'is Filipe Cunha, Ricardo Campos, Al\'ipio Jorge

TL;DR
This paper introduces a novel BERT-based framework for event extraction in Portuguese, utilizing translated ACE-2005 data and a QA-driven approach, achieving state-of-the-art results in trigger and argument classification.
Contribution
It presents the first Portuguese event extraction framework using a QA-driven approach with translated ACE-2005 data and fine-tuned BERT models.
Findings
Achieved F1 of 64.4 for trigger classification
Achieved F1 of 46.7 for argument classification
Set new state-of-the-art results in Portuguese event extraction
Abstract
Event extraction is an Information Retrieval task that commonly consists of identifying the central word for the event (trigger) and the event's arguments. This task has been extensively studied for English but lags behind for Portuguese, partly due to the lack of task-specific annotated corpora. This paper proposes a framework in which two separated BERT-based models were fine-tuned to identify and classify events in Portuguese documents. We decompose this task into two sub-tasks. Firstly, we use a token classification model to detect event triggers. To extract event arguments, we train a Question Answering model that queries the triggers about their corresponding event argument roles. Given the lack of event annotated corpora in Portuguese, we translated the original version of the ACE-2005 dataset (a reference in the field) into Portuguese, producing a new corpus for Portuguese event…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
