Distant Supervision for Relation Extraction with Linear Attenuation Simulation and Non-IID Relevance Embedding
Changsen Yuan, Heyan Huang, Chong Feng, Xiao Liu, Xiaochi Wei

TL;DR
This paper introduces a novel distant supervision approach for relation extraction that uses linear attenuation simulation and non-IID relevance embedding to better identify important words and relevant sentences, improving extraction accuracy.
Contribution
The paper presents a new method combining linear attenuation simulation and non-IID relevance embedding to enhance relation extraction in distant supervision tasks.
Findings
Improved accuracy on benchmark datasets
Effective capturing of complex word relations
Better relevance modeling within sentence bags
Abstract
Distant supervision for relation extraction is an efficient method to reduce labor costs and has been widely used to seek novel relational facts in large corpora, which can be identified as a multi-instance multi-label problem. However, existing distant supervision methods suffer from selecting important words in the sentence and extracting valid sentences in the bag. Towards this end, we propose a novel approach to address these problems in this paper. Firstly, we propose a linear attenuation simulation to reflect the importance of words in the sentence with respect to the distances between entities and words. Secondly, we propose a non-independent and identically distributed (non-IID) relevance embedding to capture the relevance of sentences in the bag. Our method can not only capture complex information of words about hidden relations, but also express the mutual information of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies
