LM-CORE: Language Models with Contextually Relevant External Knowledge

Jivat Neet Kaur; Sumit Bhatia; Milan Aggarwal; Rachit Bansal; and Balaji Krishnamurthy

arXiv:2208.06458·cs.CL·August 16, 2022

LM-CORE: Language Models with Contextually Relevant External Knowledge

Jivat Neet Kaur, Sumit Bhatia, Milan Aggarwal, Rachit Bansal, and Balaji Krishnamurthy

PDF

Open Access 1 Repo

TL;DR

LM-CORE introduces a framework that enables language models to access and utilize external structured knowledge dynamically, improving performance on knowledge tasks and allowing knowledge updates without retraining the entire model.

Contribution

The paper proposes LM-CORE, a novel framework that decouples language model training from external knowledge sources, facilitating efficient knowledge integration and updates.

Findings

01

LM-CORE outperforms state-of-the-art knowledge-enhanced models on probing tasks.

02

It effectively handles knowledge updates without retraining.

03

Performs well on downstream tasks.

Abstract

Large transformer-based pre-trained language models have achieved impressive performance on a variety of knowledge-intensive tasks and can capture factual knowledge in their parameters. We argue that storing large amounts of knowledge in the model parameters is sub-optimal given the ever-growing amounts of knowledge and resource requirements. We posit that a more efficient alternative is to provide explicit access to contextually relevant structured knowledge to the model and train it to use that knowledge. We present LM-CORE -- a general framework to achieve this -- that allows \textit{decoupling} of the language model training from the external knowledge source and allows the latter to be updated without affecting the already trained model. Experimental results show that LM-CORE, having access to external knowledge, achieves significant and robust outperformance over state-of-the-art…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sumit-research/lmcore
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications