Beyond Known Facts: Generating Unseen Temporal Knowledge to Address Data Contamination in LLM Evaluation

Arthur Amalvy; Hen-Hsen Huang

arXiv:2601.13658·cs.CL·January 21, 2026

Beyond Known Facts: Generating Unseen Temporal Knowledge to Address Data Contamination in LLM Evaluation

Arthur Amalvy, Hen-Hsen Huang

PDF

Open Access

TL;DR

This paper introduces a synthetic, future-oriented dataset for temporal knowledge graph extraction to eliminate data contamination issues and provide unbiased benchmarking for LLMs.

Contribution

The authors propose a novel method to generate unseen future temporal facts and textual descriptions, creating a contamination-free dataset for robust evaluation of TKGE models.

Findings

01

LLMs perform worse on the new dataset compared to known facts.

02

The dataset contains 4.2K future quadruples and textual descriptions.

03

The methodology enables ongoing creation of unbiased temporal datasets.

Abstract

The automatic extraction of information is important for populating large web knowledge bases such as Wikidata. The temporal version of that task, temporal knowledge graph extraction (TKGE), involves extracting temporally grounded facts from text, represented as semantic quadruples (subject, relation, object, timestamp). Many recent systems take advantage of large language models (LLMs), which are becoming a new cornerstone of the web due to their performance on many tasks across the natural language processing (NLP) field. Despite the importance of TKGE, existing datasets for training and evaluation remain scarce, and contamination of evaluation data is an unaddressed issue, potentially inflating LLMs' perceived performance due to overlaps between training and evaluation sets. To mitigate these challenges, we propose a novel synthetic evaluation dataset constructed from predicted…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Topic Modeling · Semantic Web and Ontologies