Indirect Supervision for Relation Extraction using Question-Answer Pairs
Zeqiu Wu, Xiang Ren, Frank F. Xu, Ji Li, Jiawei Han

TL;DR
This paper introduces ReQuest, a framework that uses question-answer pairs as an indirect supervision method to improve relation extraction accuracy by reducing noise from distant supervision, achieving significant F1 score improvements.
Contribution
ReQuest is a novel approach that jointly embeds relation mentions and QA pairs in shared spaces to leverage QA data for noise reduction in relation extraction.
Findings
11% average F1 score improvement on two datasets
Effective noise reduction from distant supervision
Joint embedding of relation and QA data enhances extraction accuracy
Abstract
Automatic relation extraction (RE) for types of interest is of great importance for interpreting massive text corpora in an efficient manner. Traditional RE models have heavily relied on human-annotated corpus for training, which can be costly in generating labeled data and become obstacles when dealing with more relation types. Thus, more RE extraction systems have shifted to be built upon training data automatically acquired by linking to knowledge bases (distant supervision). However, due to the incompleteness of knowledge bases and the context-agnostic labeling, the training data collected via distant supervision (DS) can be very noisy. In recent years, as increasing attention has been brought to tackling question-answering (QA) tasks, user feedback or datasets of such tasks become more accessible. In this paper, we propose a novel framework, ReQuest, to leverage question-answer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
