Longer Context, Deeper Thinking: Uncovering the Role of Long-Context Ability in Reasoning

Wang Yang; Zirui Liu; Hongye Jin; Qingyu Yin; Vipin Chaudhary; Xiaotian Han

arXiv:2505.17315·cs.AI·May 26, 2025

Longer Context, Deeper Thinking: Uncovering the Role of Long-Context Ability in Reasoning

Wang Yang, Zirui Liu, Hongye Jin, Qingyu Yin, Vipin Chaudhary, Xiaotian Han

PDF

TL;DR

This paper investigates how enhancing long-context capacity in language models improves reasoning abilities, demonstrating that stronger long-context modeling leads to better performance across reasoning benchmarks, even with short inputs.

Contribution

It provides empirical evidence that increasing long-context capacity before fine-tuning enhances reasoning performance, highlighting long-context ability as a key factor in model design.

Findings

01

Models with stronger long-context capacity outperform others in reasoning benchmarks.

02

Long-context training benefits general reasoning performance, even on short input tasks.

03

Enhancing long-context ability is crucial for developing more capable reasoning models.

Abstract

Recent language models exhibit strong reasoning capabilities, yet the influence of long-context capacity on reasoning remains underexplored. In this work, we hypothesize that current limitations in reasoning stem, in part, from insufficient long-context capacity, motivated by empirical observations such as (1) higher context window length often leads to stronger reasoning performance, and (2) failed reasoning cases resemble failed long-context cases. To test this hypothesis, we examine whether enhancing a model's long-context ability before Supervised Fine-Tuning (SFT) leads to improved reasoning performance. Specifically, we compared models with identical architectures and fine-tuning data but varying levels of long-context capacity. Our results reveal a consistent trend: models with stronger long-context capacity achieve significantly higher accuracy on reasoning benchmarks after SFT.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsShrink and Fine-Tune