CokeBERT: Contextual Knowledge Selection and Embedding towards Enhanced   Pre-Trained Language Models

Yusheng Su; Xu Han; Zhengyan Zhang; Peng Li; Zhiyuan Liu; Yankai Lin,; Jie Zhou; Maosong Sun

arXiv:2009.13964·cs.CL·April 6, 2023

CokeBERT: Contextual Knowledge Selection and Embedding towards Enhanced Pre-Trained Language Models

Yusheng Su, Xu Han, Zhengyan Zhang, Peng Li, Zhiyuan Liu, Yankai Lin,, Jie Zhou, Maosong Sun

PDF

1 Repo

TL;DR

Coke is a novel framework that dynamically selects and embeds relevant knowledge from knowledge graphs based on textual context, improving language understanding and interpretability in pre-trained language models.

Contribution

It introduces a dynamic knowledge selection mechanism for PLMs, addressing limitations of static knowledge embedding and enhancing task performance.

Findings

01

Outperforms baselines on knowledge-driven NLP tasks

02

Improves interpretability of knowledge in language models

03

Demonstrates effectiveness of dynamic knowledge context

Abstract

Several recent efforts have been devoted to enhancing pre-trained language models (PLMs) by utilizing extra heterogeneous knowledge in knowledge graphs (KGs) and achieved consistent improvements on various knowledge-driven NLP tasks. However, most of these knowledge-enhanced PLMs embed static sub-graphs of KGs ("knowledge context"), regardless of that the knowledge required by PLMs may change dynamically according to specific text ("textual context"). In this paper, we propose a novel framework named Coke to dynamically select contextual knowledge and embed knowledge context according to textual context for PLMs, which can avoid the effect of redundant and ambiguous knowledge in KGs that cannot match the input text. Our experimental results show that Coke outperforms various baselines on typical knowledge-driven NLP tasks, indicating the effectiveness of utilizing dynamic knowledge…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

thunlp/CokeBERT
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.