Beyond Known Facts: Generating Unseen Temporal Knowledge to Address Data Contamination in LLM Evaluation
Arthur Amalvy, Hen-Hsen Huang

TL;DR
This paper introduces a synthetic, future-oriented dataset for temporal knowledge graph extraction to eliminate data contamination issues and provide unbiased benchmarking for LLMs.
Contribution
The authors propose a novel method to generate unseen future temporal facts and textual descriptions, creating a contamination-free dataset for robust evaluation of TKGE models.
Findings
LLMs perform worse on the new dataset compared to known facts.
The dataset contains 4.2K future quadruples and textual descriptions.
The methodology enables ongoing creation of unbiased temporal datasets.
Abstract
The automatic extraction of information is important for populating large web knowledge bases such as Wikidata. The temporal version of that task, temporal knowledge graph extraction (TKGE), involves extracting temporally grounded facts from text, represented as semantic quadruples (subject, relation, object, timestamp). Many recent systems take advantage of large language models (LLMs), which are becoming a new cornerstone of the web due to their performance on many tasks across the natural language processing (NLP) field. Despite the importance of TKGE, existing datasets for training and evaluation remain scarce, and contamination of evaluation data is an unaddressed issue, potentially inflating LLMs' perceived performance due to overlaps between training and evaluation sets. To mitigate these challenges, we propose a novel synthetic evaluation dataset constructed from predicted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Topic Modeling · Semantic Web and Ontologies
