Novel-WD: Exploring acquisition of Novel World Knowledge in LLMs Using Prefix-Tuning
Maxime M\'eloux, Christophe Cerisara

TL;DR
This paper introduces Novel-WD, a dataset for evaluating how well large language models can learn and remember new, recent world facts using prefix-tuning, and analyzes the capacity of prefixes to encode such information.
Contribution
The paper presents Novel-WD, a new dataset for assessing learning of recent facts, and demonstrates how prefix-tuning can effectively encode and store individual new facts in LLMs.
Findings
A single fact can be reliably encoded within one prefix.
Prefix capacity increases with prefix length and model size.
The dataset is freely available for community use.
Abstract
Teaching new information to pre-trained large language models (PLM) is a crucial but challenging task. Model adaptation techniques, such as fine-tuning and parameter-efficient training have been shown to store new facts at a slow rate; continual learning is an option but is costly and prone to catastrophic forgetting. This work studies and quantifies how PLM may learn and remember new world knowledge facts that do not occur in their pre-training corpus, which only contains world knowledge up to a certain date. To that purpose, we first propose Novel-WD, a new dataset consisting of sentences containing novel facts extracted from recent Wikidata updates, along with two evaluation tasks in the form of causal language modeling and multiple choice questions (MCQ). We make this dataset freely available to the community, and release a procedure to later build new versions of similar datasets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Mathematics, Computing, and Information Processing · Library Science and Information Systems
MethodsBalanced Selection
