KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation
Xiaozhi Wang, Tianyu Gao, Zhaocheng Zhu, Zhengyan Zhang, Zhiyuan Liu,, Juanzi Li, Jian Tang

TL;DR
KEPLER is a unified model that integrates knowledge embedding with pre-trained language models, enhancing factual knowledge representation and text-based KE, achieving state-of-the-art results in NLP tasks and KG link prediction.
Contribution
This paper introduces KEPLER, a novel unified model combining KE and PLMs, and provides a large-scale KG dataset Wikidata5M for benchmarking and research.
Findings
Achieves state-of-the-art performance on various NLP tasks.
Works effectively as an inductive KE model for KG link prediction.
Provides a new large-scale KG benchmark dataset, Wikidata5M.
Abstract
Pre-trained language representation models (PLMs) cannot well capture factual knowledge from text. In contrast, knowledge embedding (KE) methods can effectively represent the relational facts in knowledge graphs (KGs) with informative entity embeddings, but conventional KE models cannot take full advantage of the abundant textual information. In this paper, we propose a unified model for Knowledge Embedding and Pre-trained LanguagE Representation (KEPLER), which can not only better integrate factual knowledge into PLMs but also produce effective text-enhanced KE with the strong PLMs. In KEPLER, we encode textual entity descriptions with a PLM as their embeddings, and then jointly optimize the KE and language modeling objectives. Experimental results show that KEPLER achieves state-of-the-art performances on various NLP tasks, and also works remarkably well as an inductive KE model on KG…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Natural Language Processing Techniques
