FREDA: Flexible Relation Extraction Data Annotation

Michael Strobl; Amine Trabelsi; Osmar Zaiane

arXiv:2204.07150·cs.CL·December 15, 2022

FREDA: Flexible Relation Extraction Data Annotation

Michael Strobl, Amine Trabelsi, Osmar Zaiane

PDF

Open Access 1 Repo

TL;DR

FREDA introduces a flexible annotation method that rapidly produces high-quality relation extraction datasets, enabling effective training of neural models with good generalization across datasets.

Contribution

The paper presents a novel annotation approach that accelerates dataset creation for relation extraction without sacrificing quality.

Findings

01

Annotated 10,022 sentences for 19 relations efficiently

02

Neural models trained on FREDA datasets perform well and generalize

03

High-quality datasets can be produced quickly for relation extraction

Abstract

To effectively train accurate Relation Extraction models, sufficient and properly labeled data is required. Adequately labeled data is difficult to obtain and annotating such data is a tricky undertaking. Previous works have shown that either accuracy has to be sacrificed or the task is extremely time-consuming, if done accurately. We are proposing an approach in order to produce high-quality datasets for the task of Relation Extraction quickly. Neural models, trained to do Relation Extraction on the created datasets, achieve very good results and generalize well to other datasets. In our study, we were able to annotate 10,022 sentences for 19 relations in a reasonable amount of time, and trained a commonly used baseline model for each relation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mjstrobl/freda
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · linguistics and terminology studies