A Survey of Knowledge Enhanced Pre-trained Models

Jian Yang; Xinyu Hu; Gang Xiao; Yulong Shen

arXiv:2110.00269·cs.CL·October 31, 2023·23 cites

A Survey of Knowledge Enhanced Pre-trained Models

Jian Yang, Xinyu Hu, Gang Xiao, Yulong Shen

PDF

Open Access

TL;DR

This survey reviews knowledge-enhanced pre-trained language models (KEPLMs) in NLP, highlighting their advancements, categorization, and potential future research directions to improve robustness, interpretability, and reasoning capabilities.

Contribution

It provides a comprehensive overview and systematic categorization of KEPLMs, emphasizing their role in enhancing understanding and interpretability in NLP.

Findings

01

KEPLMs improve model interpretability and reasoning.

02

Categorization of KEPLMs based on knowledge integration methods.

03

Identification of future research directions for KEPLMs.

Abstract

Pre-trained language models learn informative word representations on a large-scale text corpus through self-supervised learning, which has achieved promising performance in fields of natural language processing (NLP) after fine-tuning. These models, however, suffer from poor robustness and lack of interpretability. We refer to pre-trained language models with knowledge injection as knowledge-enhanced pre-trained language models (KEPLMs). These models demonstrate deep understanding and logical reasoning and introduce interpretability. In this survey, we provide a comprehensive overview of KEPLMs in NLP. We first discuss the advancements in pre-trained language models and knowledge representation learning. Then we systematically categorize existing KEPLMs from three different perspectives. Finally, we outline some potential directions of KEPLMs for future research.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications