ChroKnowledge: Unveiling Chronological Knowledge of Language Models in   Multiple Domains

Yein Park; Chanwoong Yoon; Jungwoo Park; Donghyeon Lee; Minbyul Jeong,; Jaewoo Kang

arXiv:2410.09870·cs.CL·March 3, 2025

ChroKnowledge: Unveiling Chronological Knowledge of Language Models in Multiple Domains

Yein Park, Chanwoong Yoon, Jungwoo Park, Donghyeon Lee, Minbyul Jeong,, Jaewoo Kang

PDF

Open Access 1 Repo 1 Datasets 1 Video

TL;DR

This paper introduces ChroKnowBench, a benchmark dataset and evaluation framework for assessing the chronological knowledge of language models across multiple domains and temporal aspects, highlighting their strengths and limitations.

Contribution

It presents a novel benchmark and sampling-based framework to evaluate LLMs' ability to recall and adapt to evolving and static knowledge over time.

Findings

01

LLMs' temporal knowledge recall varies with training data format.

02

Models partially recall knowledge and often cut off at temporal boundaries.

03

The proposed prompting method improves knowledge elicitation across different LLMs.

Abstract

Large language models (LLMs) have brought significant changes to many aspects of our lives. However, assessing and ensuring their chronological knowledge remains challenging. Existing approaches fall short in addressing the temporal adaptability of knowledge, often relying on a fixed time-point view. To overcome this, we introduce ChroKnowBench, a benchmark dataset designed to evaluate chronologically accumulated knowledge across three key aspects: multiple domains, time dependency, temporal state. Our benchmark distinguishes between knowledge that evolves (e.g., personal history, scientific discoveries, amended laws) and knowledge that remain constant (e.g., mathematical truths, commonsense facts). Building on this benchmark, we present ChroKnowledge (Chronological Categorization of Knowledge), a novel sampling-based framework for evaluating LLMs' non-parametric chronological…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dmis-lab/chroknowledge
pytorchOfficial

Datasets

dmis-lab/ChroKnowBench
dataset· 95 dl
95 dl

Videos

ChroKnowledge: Unveiling Chronological Knowledge of Language Models in Multiple Domains· slideslive

Taxonomy

TopicsNatural Language Processing Techniques