Improving Distantly Supervised Relation Extraction with Self-Ensemble   Noise Filtering

Tapas Nayak; Navonil Majumder; Soujanya Poria

arXiv:2108.09689·cs.CL·August 24, 2021

Improving Distantly Supervised Relation Extraction with Self-Ensemble Noise Filtering

Tapas Nayak, Navonil Majumder, Soujanya Poria

PDF

Open Access 1 Repo

TL;DR

This paper introduces a self-ensemble filtering method to reduce noise in distantly supervised relation extraction, improving model robustness and F1 scores on the NYT dataset.

Contribution

It proposes a novel self-ensemble filtering mechanism to effectively remove noisy samples during training in distantly supervised relation extraction.

Findings

01

Improves F1 scores of neural relation extraction models.

02

Enhances robustness of models against noisy training data.

03

Validated on NYT dataset with multiple state-of-the-art models.

Abstract

Distantly supervised models are very popular for relation extraction since we can obtain a large amount of training data using the distant supervision method without human annotation. In distant supervision, a sentence is considered as a source of a tuple if the sentence contains both entities of the tuple. However, this condition is too permissive and does not guarantee the presence of relevant relation-specific information in the sentence. As such, distantly supervised training data contains much noise which adversely affects the performance of the models. In this paper, we propose a self-ensemble filtering mechanism to filter out the noisy samples during the training process. We evaluate our proposed framework on the New York Times dataset which is obtained via distant supervision. Our experiments with multiple state-of-the-art neural relation extraction models show that our proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nayakt/SENF4DSRE
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies