Time-Aware Language Models as Temporal Knowledge Bases

Bhuwan Dhingra; Jeremy R. Cole; Julian Martin Eisenschlos; Daniel; Gillick; Jacob Eisenstein; William W. Cohen

arXiv:2106.15110·cs.CL·April 26, 2022

Time-Aware Language Models as Temporal Knowledge Bases

Bhuwan Dhingra, Jeremy R. Cole, Julian Martin Eisenschlos, Daniel, Gillick, Jacob Eisenstein, William W. Cohen

PDF

TL;DR

This paper introduces a diagnostic dataset and a temporal modeling technique for language models, enabling them to better handle time-sensitive facts and be efficiently updated with new data.

Contribution

It proposes a simple method to incorporate timestamps into language models, improving factual accuracy over time and allowing efficient model updates without retraining.

Findings

01

Temporal modeling improves factual accuracy for time-sensitive information.

02

Models trained with timestamps better calibrate predictions about future facts.

03

The approach enables efficient model updates as new data becomes available.

Abstract

Many facts come with an expiration date, from the name of the President to the basketball team Lebron James plays for. But language models (LMs) are trained on snapshots of data collected at a specific moment in time, and this can limit their utility, especially in the closed-book setting where the pretraining corpus must contain the facts the model should memorize. We introduce a diagnostic dataset aimed at probing LMs for factual knowledge that changes over time and highlight problems with LMs at either end of the spectrum -- those trained on specific slices of temporal data, as well as those trained on a wide range of temporal data. To mitigate these problems, we propose a simple technique for jointly modeling text with its timestamp. This improves memorization of seen facts from the training time period, as well as calibration on predictions about unseen facts from future time…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.