A Survey on Knowledge-Enhanced Pre-trained Language Models

Chaoqi Zhen; Yanlei Shang; Xiangyu Liu; Yifei Li; Yong; Chen; Dell Zhang

arXiv:2212.13428·cs.CL·December 29, 2022·5 cites

A Survey on Knowledge-Enhanced Pre-trained Language Models

Chaoqi Zhen, Yanlei Shang, Xiangyu Liu, Yifei Li, Yong, Chen, Dell Zhang

PDF

Open Access

TL;DR

This survey reviews the integration of external knowledge into pre-trained language models, highlighting methods, applications, challenges, and future directions to enhance NLP performance and interpretability.

Contribution

It provides a comprehensive overview of knowledge-enhanced pre-trained language models, including knowledge types, integration methods, evaluation, and applications, summarizing recent advancements in the field.

Findings

01

KEPLMs improve reasoning and interpretability.

02

Various knowledge integration methods exist and are effective.

03

Future research directions include better knowledge integration techniques.

Abstract

Natural Language Processing (NLP) has been revolutionized by the use of Pre-trained Language Models (PLMs) such as BERT. Despite setting new records in nearly every NLP task, PLMs still face a number of challenges including poor interpretability, weak reasoning capability, and the need for a lot of expensive annotated data when applied to downstream tasks. By integrating external knowledge into PLMs, \textit{\underline{K}nowledge-\underline{E}nhanced \underline{P}re-trained \underline{L}anguage \underline{M}odels} (KEPLMs) have the potential to overcome the above-mentioned limitations. In this paper, we examine KEPLMs systematically through a series of studies. Specifically, we outline the common types and different formats of knowledge to be integrated into KEPLMs, detail the existing methods for building and evaluating KEPLMS, present the applications of KEPLMs in downstream tasks,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Weight Decay · WordPiece · Dense Connections · Linear Warmup With Linear Decay · Layer Normalization