Uncover the Ground-Truth Relations in Distant Supervision: A Neural   Expectation-Maximization Framework

Junfan Chen; Richong Zhang; Yongyi Mao; Hongyu Guo; Jie Xu

arXiv:1909.05448·cs.CL·September 13, 2019

Uncover the Ground-Truth Relations in Distant Supervision: A Neural Expectation-Maximization Framework

Junfan Chen, Richong Zhang, Yongyi Mao, Hongyu Guo, Jie Xu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a neural EM framework for relation extraction that effectively handles noisy labels in distant supervision, significantly improving the accuracy of uncovering true relation labels from large text corpora.

Contribution

It presents a novel label-denoising approach combining neural networks with probabilistic modeling to address noisy labels in distant supervision for relation extraction.

Findings

01

Significant performance improvement over existing methods.

02

Effective in uncovering ground-truth relation labels.

03

Robust to noisy label conditions.

Abstract

Distant supervision for relation extraction enables one to effectively acquire structured relations out of very large text corpora with less human efforts. Nevertheless, most of the prior-art models for such tasks assume that the given text can be noisy, but their corresponding labels are clean. Such unrealistic assumption is contradictory with the fact that the given labels are often noisy as well, thus leading to significant performance degradation of those models on real-world data. To cope with this challenge, we propose a novel label-denoising framework that combines neural network with probabilistic modelling, which naturally takes into account the noisy labels during learning. We empirically demonstrate that our approach significantly improves the current art in uncovering the ground-truth relation labels.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

AlbertChen1991/nEM
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques