A Survey on Knowledge-Enhanced Pre-trained Language Models
Chaoqi Zhen, Yanlei Shang, Xiangyu Liu, Yifei Li, Yong, Chen, Dell Zhang

TL;DR
This survey reviews the integration of external knowledge into pre-trained language models, highlighting methods, applications, challenges, and future directions to enhance NLP performance and interpretability.
Contribution
It provides a comprehensive overview of knowledge-enhanced pre-trained language models, including knowledge types, integration methods, evaluation, and applications, summarizing recent advancements in the field.
Findings
KEPLMs improve reasoning and interpretability.
Various knowledge integration methods exist and are effective.
Future research directions include better knowledge integration techniques.
Abstract
Natural Language Processing (NLP) has been revolutionized by the use of Pre-trained Language Models (PLMs) such as BERT. Despite setting new records in nearly every NLP task, PLMs still face a number of challenges including poor interpretability, weak reasoning capability, and the need for a lot of expensive annotated data when applied to downstream tasks. By integrating external knowledge into PLMs, \textit{\underline{K}nowledge-\underline{E}nhanced \underline{P}re-trained \underline{L}anguage \underline{M}odels} (KEPLMs) have the potential to overcome the above-mentioned limitations. In this paper, we examine KEPLMs systematically through a series of studies. Specifically, we outline the common types and different formats of knowledge to be integrated into KEPLMs, detail the existing methods for building and evaluating KEPLMS, present the applications of KEPLMs in downstream tasks,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Weight Decay · WordPiece · Dense Connections · Linear Warmup With Linear Decay · Layer Normalization
