SPOT: Knowledge-Enhanced Language Representations for Information Extraction
Jiacheng Li, Yannis Katsis, Tyler Baldwin, Ho-Cheol Kim, Andrew, Bartko, Julian McAuley, Chun-Nan Hsu

TL;DR
This paper introduces SPOT, a knowledge-enhanced language model that efficiently learns representations of entities and relationships from text spans, improving information extraction tasks with fewer parameters.
Contribution
The paper proposes a novel span-based pre-trained model that jointly encodes entities and relationships, addressing limitations of previous models in handling out-of-vocabulary entities and parameter efficiency.
Findings
Outperforms baselines in representing entities and relationships.
Fine-tuning surpasses RoBERTa in supervised information extraction tasks.
Achieves competitive results across various extraction benchmarks.
Abstract
Knowledge-enhanced pre-trained models for language representation have been shown to be more effective in knowledge base construction tasks (i.e.,~relation extraction) than language models such as BERT. These knowledge-enhanced language models incorporate knowledge into pre-training to generate representations of entities or relationships. However, existing methods typically represent each entity with a separate embedding. As a result, these methods struggle to represent out-of-vocabulary entities and a large amount of parameters, on top of their underlying token models (i.e.,~the transformer), must be used and the number of entities that can be handled is limited in practice due to memory constraints. Moreover, existing models still struggle to represent entities and relationships simultaneously. To address these problems, we propose a new pre-trained model that learns representations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
MethodsMulti-Head Attention · Attention Is All You Need · Test · Linear Layer · Refunds@Expedia|||How do I get a full refund from Expedia? · Residual Connection · WordPiece · Linear Warmup With Linear Decay · Attention Dropout · Dropout
