RAG or Learning? Understanding the Limits of LLM Adaptation under Continuous Knowledge Drift in the Real World
Hanbing Liu, Lang Cao, Yang Li

TL;DR
This paper introduces a benchmark for evaluating how well large language models adapt to continuously changing real-world knowledge, revealing limitations of current methods and proposing a time-aware retrieval approach to improve temporal consistency.
Contribution
The work presents a new benchmark for real-world knowledge evolution and proposes Chronos, a time-aware retrieval method that enhances temporal consistency without extra training.
Findings
Most existing methods struggle with continuous knowledge drift.
Vanilla RAG and learning-based approaches face catastrophic forgetting.
Chronos improves temporal consistency in LLMs without additional training.
Abstract
Large language models (LLMs) acquire most of their knowledge during pretraining, which ties them to a fixed snapshot of the world and makes adaptation to continuously evolving knowledge challenging. As facts, entities, and events change over time, models may experience continuous knowledge drift, resulting not only in outdated predictions but also in temporally inconsistent reasoning. Although existing approaches, such as continual finetuning, knowledge editing, and retrieval-augmented generation (RAG), aim to update or supplement model knowledge, they are rarely evaluated in settings that reflect chronological, evolving, and real-world knowledge evolution. In this work, we introduce a new benchmark of real-world dynamic events, constructed from time-stamped evidence that captures how knowledge evolves over time, which enables systematic evaluation of model adaptation under continuous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
