Global Relation Embedding for Relation Extraction

Yu Su; Honglei Liu; Semih Yavuz; Izzeddin Gur; Huan Sun; Xifeng Yan

arXiv:1704.05958·cs.CL·April 20, 2018·1 cites

Global Relation Embedding for Relation Extraction

Yu Su, Honglei Liu, Semih Yavuz, Izzeddin Gur, Huan Sun, Xifeng Yan

PDF

Open Access 2 Repos

TL;DR

This paper introduces a global relation embedding method that leverages co-occurrence statistics to improve relation extraction accuracy under distant supervision, significantly reducing noise effects.

Contribution

It proposes a novel global statistics-based textual relation embedding approach that enhances existing models' performance in relation extraction tasks.

Findings

01

Improved relation extraction accuracy from 83.9% to 89.3%.

02

Embedding approach is more robust to training noise.

03

Significant performance gains on a popular dataset.

Abstract

We study the problem of textual relation embedding with distant supervision. To combat the wrong labeling problem of distant supervision, we propose to embed textual relations with global statistics of relations, i.e., the co-occurrence statistics of textual and knowledge base relations collected from the entire corpus. This approach turns out to be more robust to the training noise introduced by distant supervision. On a popular relation extraction dataset, we show that the learned textual relation embedding can be used to augment existing relation extraction models and significantly improve their performance. Most remarkably, for the top 1,000 relational facts discovered by the best existing model, the precision can be improved from 83.9% to 89.3%.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies