ChroKnowledge: Unveiling Chronological Knowledge of Language Models in Multiple Domains
Yein Park, Chanwoong Yoon, Jungwoo Park, Donghyeon Lee, Minbyul Jeong,, Jaewoo Kang

TL;DR
This paper introduces ChroKnowBench, a benchmark dataset and evaluation framework for assessing the chronological knowledge of language models across multiple domains and temporal aspects, highlighting their strengths and limitations.
Contribution
It presents a novel benchmark and sampling-based framework to evaluate LLMs' ability to recall and adapt to evolving and static knowledge over time.
Findings
LLMs' temporal knowledge recall varies with training data format.
Models partially recall knowledge and often cut off at temporal boundaries.
The proposed prompting method improves knowledge elicitation across different LLMs.
Abstract
Large language models (LLMs) have brought significant changes to many aspects of our lives. However, assessing and ensuring their chronological knowledge remains challenging. Existing approaches fall short in addressing the temporal adaptability of knowledge, often relying on a fixed time-point view. To overcome this, we introduce ChroKnowBench, a benchmark dataset designed to evaluate chronologically accumulated knowledge across three key aspects: multiple domains, time dependency, temporal state. Our benchmark distinguishes between knowledge that evolves (e.g., personal history, scientific discoveries, amended laws) and knowledge that remain constant (e.g., mathematical truths, commonsense facts). Building on this benchmark, we present ChroKnowledge (Chronological Categorization of Knowledge), a novel sampling-based framework for evaluating LLMs' non-parametric chronological…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNatural Language Processing Techniques
