Can LMs Learn New Entities from Descriptions? Challenges in Propagating   Injected Knowledge

Yasumasa Onoe; Michael J.Q. Zhang; Shankar Padmanabhan; Greg Durrett,; Eunsol Choi

arXiv:2305.01651·cs.CL·May 3, 2023·1 cites

Can LMs Learn New Entities from Descriptions? Challenges in Propagating Injected Knowledge

Yasumasa Onoe, Michael J.Q. Zhang, Shankar Padmanabhan, Greg Durrett,, Eunsol Choi

PDF

Open Access 1 Repo

TL;DR

This paper investigates whether pre-trained language models can effectively propagate injected knowledge about entities through inference, revealing limitations of current update methods and potential improvements via context prepending.

Contribution

The study introduces a new benchmark for testing knowledge propagation in LMs and compares different injection methods, highlighting the effectiveness of context prepending over traditional fine-tuning.

Findings

01

Existing update methods show limited knowledge propagation.

02

Performance improves with lexical overlap between facts and inferences.

03

Context prepending significantly enhances knowledge inference across settings.

Abstract

Pre-trained language models (LMs) are used for knowledge intensive tasks like question answering, but their knowledge gets continuously outdated as the world changes. Prior work has studied targeted updates to LMs, injecting individual facts and evaluating whether the model learns these facts while not changing predictions on other contexts. We take a step forward and study LMs' abilities to make inferences based on injected facts (or propagate those facts): for example, after learning that something is a TV show, does an LM predict that you can watch it? We study this with two cloze-style tasks: an existing dataset of real-world sentences about novel entities (ECBD) as well as a new controlled benchmark with manually designed templates requiring varying levels of inference about injected knowledge. Surprisingly, we find that existing methods for updating knowledge (gradient-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yasumasaonoe/entity_knowledge_propagation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications