From Bag of Sentences to Document: Distantly Supervised Relation Extraction via Machine Reading Comprehension
Lingyong Yan, Xianpei Han, Le Sun, Fangchao Liu, Ning Bian

TL;DR
This paper introduces a document-based distant supervision approach for relation extraction, reformulating it as a machine reading comprehension task to better utilize evidence and reduce noise, achieving state-of-the-art results.
Contribution
It proposes a novel document-based DS paradigm and a new loss function, DSLoss, to improve relation extraction by leveraging comprehensive evidence and handling noisy labels.
Findings
Achieves state-of-the-art DS relation extraction performance.
Effectively leverages inter-sentence and entity-level evidence.
Reduces noisy label impact through the DSLoss function.
Abstract
Distant supervision (DS) is a promising approach for relation extraction but often suffers from the noisy label problem. Traditional DS methods usually represent an entity pair as a bag of sentences and denoise labels using multi-instance learning techniques. The bag-based paradigm, however, fails to leverage the inter-sentence-level and the entity-level evidence for relation extraction, and their denoising algorithms are often specialized and complicated. In this paper, we propose a new DS paradigm--document-based distant supervision, which models relation extraction as a document-based machine reading comprehension (MRC) task. By re-organizing all sentences about an entity as a document and extracting relations via querying the document with relation-specific questions, the document-based DS paradigm can simultaneously encode and exploit all sentence-level, inter-sentence-level, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
MethodsLinear Layer · Residual Connection · Weight Decay · Attention Dropout · Linear Warmup With Linear Decay · WordPiece · Adam · Dropout · Softmax · Dense Connections
