LongR: Unleashing Long-Context Reasoning via Reinforcement Learning with Dense Utility Rewards

Bowen Ping; Zijun Chen; Yiyao Yu; Tingfeng Hui; Junchi Yan; Baobao Chang

arXiv:2602.05758·cs.CL·February 6, 2026

LongR: Unleashing Long-Context Reasoning via Reinforcement Learning with Dense Utility Rewards

Bowen Ping, Zijun Chen, Yiyao Yu, Tingfeng Hui, Junchi Yan, Baobao Chang

PDF

Open Access

TL;DR

LongR introduces a reinforcement learning framework that improves long-context reasoning in language models by using dense utility rewards and a dynamic reasoning mechanism, leading to significant performance gains.

Contribution

The paper presents LongR, a novel RL-based approach that integrates a reasoning mechanism with dense utility rewards to enhance long-context reasoning capabilities.

Findings

01

Achieves 9% improvement on LongBench v2

02

Consistently improves performance across multiple RL algorithms

03

Enhances robustness against distractors in reasoning tasks

Abstract

Reinforcement Learning has emerged as a key driver for LLM reasoning. This capability is equally pivotal in long-context scenarios--such as long-dialogue understanding and structured data analysis, where the challenge extends beyond consuming tokens to performing rigorous deduction. While existing efforts focus on data synthesis or architectural changes, recent work points out that relying solely on sparse, outcome-only rewards yields limited gains, as such coarse signals are often insufficient to effectively guide the complex long-context reasoning. To address this, we propose LongR, a unified framework that enhances long-context performance by integrating a dynamic "Think-and-Read" mechanism, which interleaves reasoning with document consultation, with a contextual density reward based on relative information gain to quantify the utility of the relevant documents. Empirically, LongR…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Advanced Graph Neural Networks