Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP

Omer Goldman; Alon Jacovi; Aviv Slobodkin; Aviya Maimon; Ido Dagan; Reut Tsarfaty

arXiv:2407.00402·cs.CL·July 8, 2025

Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP

Omer Goldman, Alon Jacovi, Aviv Slobodkin, Aviya Maimon, Ido Dagan, Reut Tsarfaty

PDF

Open Access 1 Video

TL;DR

This paper argues that long-context NLP tasks are diverse and should be categorized by difficulty properties like information diffusion and scope, to improve research focus and benchmark design.

Contribution

It introduces a taxonomy of long-context tasks based on difficulty axes and advocates for more precise task definitions and benchmarks.

Findings

01

Long-context tasks vary significantly in difficulty.

02

A taxonomy based on diffusion and scope clarifies task complexity.

03

Under-explored areas involve highly diffused, lengthy necessary information.

Abstract

Improvements in language models' capabilities have pushed their applications towards longer contexts, making long-context evaluation and development an active research area. However, many disparate use-cases are grouped together under the umbrella term of "long-context", defined simply by the total length of the model's input, including - for example - Needle-in-a-Haystack tasks, book summarization, and information aggregation. Given their varied difficulty, in this position paper we argue that conflating different tasks by their context length is unproductive. As a community, we require a more precise vocabulary to understand what makes long-context tasks similar or different. We propose to unpack the taxonomy of long-context based on the properties that make them more difficult with longer contexts. We propose two orthogonal axes of difficulty: (I) Diffusion: How hard is it to find…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Semantic Web and Ontologies