A Decomposition Perspective to Long-context Reasoning for LLMs

Yanling Xiao; Huaibing Xie; Guoliang Zhao; Shihan Dou; Shaolei Wang; Yiting Liu; Nantao Zheng; Cheng Zhang; Pluto Zhou; Zhisong Zhang; Lemao Liu

arXiv:2604.07981·cs.CL·April 10, 2026

A Decomposition Perspective to Long-context Reasoning for LLMs

Yanling Xiao, Huaibing Xie, Guoliang Zhao, Shihan Dou, Shaolei Wang, Yiting Liu, Nantao Zheng, Cheng Zhang, Pluto Zhou, Zhisong Zhang, Lemao Liu

PDF

TL;DR

This paper introduces a decomposition approach to long-context reasoning in LLMs, breaking down complex tasks into atomic skills, and uses reinforcement learning on targeted pseudo datasets to improve overall reasoning performance.

Contribution

It proposes decomposing long-context reasoning into atomic skills and synthesizing targeted pseudo datasets, then employs reinforcement learning to enhance these skills and boost reasoning ability.

Findings

01

Atomic skills strongly correlate with reasoning performance

02

Reinforcement learning on pseudo datasets improves accuracy by 7.7%

03

Outperforms baseline across multiple benchmarks

Abstract

Long-context reasoning is essential for complex real-world applications, yet remains a significant challenge for Large Language Models (LLMs). Despite the rapid evolution in long-context reasoning, current research often overlooks the internal complexity of the long-context reasoning task itself. In this paper, we move beyond this holistic view and decompose long-context reasoning into a set of fundamental atomic skills, and we then automatically synthesize a suite of pseudo datasets, each explicitly targeting a specific atomic skill. Our empirical analysis confirms that proficiency in these atomic skills is strongly correlated with general long-text reasoning performance. Building on this insight, we employ reinforcement learning on these pseudo datasets to sharpen the model's atomic skills, in the hope of boosting its general long-context reasoning ability. Extensive experiments…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.