Active Testing: An Unbiased Evaluation Method for Distantly Supervised   Relation Extraction

Pengshuai Li; Xinsong Zhang; Weijia Jia; Wei Zhao

arXiv:2010.08777·cs.CL·October 20, 2020

Active Testing: An Unbiased Evaluation Method for Distantly Supervised Relation Extraction

Pengshuai Li, Xinsong Zhang, Weijia Jia, Wei Zhao

PDF

Open Access

TL;DR

This paper introduces active testing, a new evaluation method that combines noisy test data with manual annotations to provide more accurate and unbiased performance assessments for distantly supervised relation extraction models.

Contribution

The paper proposes a novel active testing approach that mitigates bias in evaluation by integrating noisy datasets with limited manual annotations.

Findings

01

Active testing reduces bias in relation extraction evaluation.

02

The method achieves approximately unbiased performance estimates.

03

Experiments demonstrate improved evaluation accuracy on benchmark datasets.

Abstract

Distant supervision has been a widely used method for neural relation extraction for its convenience of automatically labeling datasets. However, existing works on distantly supervised relation extraction suffer from the low quality of test set, which leads to considerable biased performance evaluation. These biases not only result in unfair evaluations but also mislead the optimization of neural relation extraction. To mitigate this problem, we propose a novel evaluation method named active testing through utilizing both the noisy test set and a few manual annotations. Experiments on a widely used benchmark show that our proposed approach can yield approximately unbiased evaluations for distantly supervised relation extractors.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Algorithms