"Lost-in-the-Later": Framework for Quantifying Contextual Grounding in Large Language Models

Yufei Tao; Adam Hiatt; Rahul Seetharaman; Ameeta Agrawal

arXiv:2507.05424·cs.CL·July 9, 2025

"Lost-in-the-Later": Framework for Quantifying Contextual Grounding in Large Language Models

Yufei Tao, Adam Hiatt, Rahul Seetharaman, Ameeta Agrawal

PDF

Open Access

TL;DR

This paper introduces CoPE, a framework for evaluating how large language models prioritize contextual versus parametric knowledge, revealing a positional bias called 'lost-in-the-later' that impacts their ability to utilize context effectively.

Contribution

The paper presents CoPE, a novel evaluation framework and dataset for systematically measuring contextual and parametric knowledge in LLMs across multiple languages, and uncovers the lost-in-the-later bias affecting contextual grounding.

Findings

01

LLMs tend to overlook information appearing later in context.

02

Chain-of-thought prompting reduces contextual recall and degrades grounding.

03

Prompt-based methods leveraging context improve factual accuracy in summarization.

Abstract

Large language models are capable of leveraging both contextual and parametric knowledge but how they prioritize and integrate these sources remains underexplored. We introduce CoPE, a novel evaluation framework that systematically measures contextual knowledge (CK) and parametric knowledge (PK) across models and languages. Using our MultiWikiAtomic dataset in English, Spanish, and Danish, we analyze how large language models (LLMs) integrate context, prioritize information, and incorporate PK in open-ended question answering. Our analysis uncovers a phenomenon we call lost-in-the-later, where LLMs tend to overlook or deprioritize information that appears later in a given context, revealing a strong positional bias that affects contextual grounding. We further find that reasoning models, as well as non-reasoning models prompted with chain-of-thought (CoT), use context even less than…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Neurobiology of Language and Bilingualism