Improving Relation Extraction by Pre-trained Language Representations
Christoph Alt, Marc H\"ubner, Leonhard Hennig

TL;DR
This paper introduces TRE, a Transformer-based relation extraction model that leverages pre-trained language representations to improve accuracy, reduce reliance on explicit linguistic features, and enhance sample efficiency.
Contribution
TRE is the first to use pre-trained deep language models for relation extraction, achieving state-of-the-art results without explicit linguistic features.
Findings
Achieves new state-of-the-art F1 scores on TACRED and SemEval datasets.
Significantly reduces data requirements, matching baseline performance with only 20% of training data.
Demonstrates effective modeling of long-range dependencies in relation extraction.
Abstract
Current state-of-the-art relation extraction methods typically rely on a set of lexical, syntactic, and semantic features, explicitly computed in a pre-processing step. Training feature extraction models requires additional annotated language resources, which severely restricts the applicability and portability of relation extraction to novel languages. Similarly, pre-processing introduces an additional source of error. To address these limitations, we introduce TRE, a Transformer for Relation Extraction, extending the OpenAI Generative Pre-trained Transformer [Radford et al., 2018]. Unlike previous relation extraction models, TRE uses pre-trained deep language representations instead of explicit linguistic features to inform the relation classification and combines it with the self-attentive Transformer architecture to effectively model long-range dependencies between entity mentions.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Byte Pair Encoding · Dense Connections · Label Smoothing · *Communicated@Fast*How Do I Communicate to Expedia? · Adam · Softmax
