Hybrid Attention-Based Transformer Block Model for Distant Supervision Relation Extraction
Yan Xiao, Yaochu Jin, Ran Cheng, Kuangrong Hao

TL;DR
This paper introduces a hybrid attention-based Transformer model combined with multi-instance learning to improve distant supervision relation extraction, effectively reducing wrong labels and outperforming existing methods on benchmark datasets.
Contribution
The paper proposes a novel Transformer-based framework with sentence-level attention for DSRE, enhancing relation extraction accuracy by addressing label noise.
Findings
Outperforms state-of-the-art algorithms on NYT dataset
Effectively captures syntactic and semantic features of sentences
Reduces wrong labeling issues in distant supervision
Abstract
With an exponential explosive growth of various digital text information, it is challenging to efficiently obtain specific knowledge from massive unstructured text information. As one basic task for natural language processing (NLP), relation extraction aims to extract the semantic relation between entity pairs based on the given text. To avoid manual labeling of datasets, distant supervision relation extraction (DSRE) has been widely used, aiming to utilize knowledge base to automatically annotate datasets. Unfortunately, this method heavily suffers from wrong labelling due to the underlying strong assumptions. To address this issue, we propose a new framework using hybrid attention-based Transformer block with multi-instance learning to perform the DSRE task. More specifically, the Transformer block is firstly used as the sentence encoder to capture syntactic information of sentences,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Byte Pair Encoding · Dense Connections · Label Smoothing · *Communicated@Fast*How Do I Communicate to Expedia? · Adam · Softmax
