A Decomposition Perspective to Long-context Reasoning for LLMs
Yanling Xiao, Huaibing Xie, Guoliang Zhao, Shihan Dou, Shaolei Wang, Yiting Liu, Nantao Zheng, Cheng Zhang, Pluto Zhou, Zhisong Zhang, Lemao Liu

TL;DR
This paper introduces a decomposition approach to long-context reasoning in LLMs, breaking down complex tasks into atomic skills, and uses reinforcement learning on targeted pseudo datasets to improve overall reasoning performance.
Contribution
It proposes decomposing long-context reasoning into atomic skills and synthesizing targeted pseudo datasets, then employs reinforcement learning to enhance these skills and boost reasoning ability.
Findings
Atomic skills strongly correlate with reasoning performance
Reinforcement learning on pseudo datasets improves accuracy by 7.7%
Outperforms baseline across multiple benchmarks
Abstract
Long-context reasoning is essential for complex real-world applications, yet remains a significant challenge for Large Language Models (LLMs). Despite the rapid evolution in long-context reasoning, current research often overlooks the internal complexity of the long-context reasoning task itself. In this paper, we move beyond this holistic view and decompose long-context reasoning into a set of fundamental atomic skills, and we then automatically synthesize a suite of pseudo datasets, each explicitly targeting a specific atomic skill. Our empirical analysis confirms that proficiency in these atomic skills is strongly correlated with general long-text reasoning performance. Building on this insight, we employ reinforcement learning on these pseudo datasets to sharpen the model's atomic skills, in the hope of boosting its general long-context reasoning ability. Extensive experiments…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
