SPOT: Knowledge-Enhanced Language Representations for Information   Extraction

Jiacheng Li; Yannis Katsis; Tyler Baldwin; Ho-Cheol Kim; Andrew; Bartko; Julian McAuley; Chun-Nan Hsu

arXiv:2208.09625·cs.CL·October 25, 2022·1 cites

SPOT: Knowledge-Enhanced Language Representations for Information Extraction

Jiacheng Li, Yannis Katsis, Tyler Baldwin, Ho-Cheol Kim, Andrew, Bartko, Julian McAuley, Chun-Nan Hsu

PDF

Open Access

TL;DR

This paper introduces SPOT, a knowledge-enhanced language model that efficiently learns representations of entities and relationships from text spans, improving information extraction tasks with fewer parameters.

Contribution

The paper proposes a novel span-based pre-trained model that jointly encodes entities and relationships, addressing limitations of previous models in handling out-of-vocabulary entities and parameter efficiency.

Findings

01

Outperforms baselines in representing entities and relationships.

02

Fine-tuning surpasses RoBERTa in supervised information extraction tasks.

03

Achieves competitive results across various extraction benchmarks.

Abstract

Knowledge-enhanced pre-trained models for language representation have been shown to be more effective in knowledge base construction tasks (i.e.,~relation extraction) than language models such as BERT. These knowledge-enhanced language models incorporate knowledge into pre-training to generate representations of entities or relationships. However, existing methods typically represent each entity with a separate embedding. As a result, these methods struggle to represent out-of-vocabulary entities and a large amount of parameters, on top of their underlying token models (i.e.,~the transformer), must be used and the number of entities that can be handled is limited in practice due to memory constraints. Moreover, existing models still struggle to represent entities and relationships simultaneously. To address these problems, we propose a new pre-trained model that learns representations…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsMulti-Head Attention · Attention Is All You Need · Test · Linear Layer · Refunds@Expedia|||How do I get a full refund from Expedia? · Residual Connection · WordPiece · Linear Warmup With Linear Decay · Attention Dropout · Dropout