InfoDensity: Rewarding Information-Dense Traces for Efficient Reasoning

Chengwei Wei; Jung-jae Kim; Longyin Zhang; Shengkai Chen; Nancy F. Chen

arXiv:2603.17310·cs.AI·March 19, 2026

InfoDensity: Rewarding Information-Dense Traces for Efficient Reasoning

Chengwei Wei, Jung-jae Kim, Longyin Zhang, Shengkai Chen, Nancy F. Chen

PDF

Open Access

TL;DR

This paper introduces InfoDensity, a reward framework for reinforcement learning that encourages information-dense reasoning traces in large language models, leading to more efficient and accurate reasoning with less verbosity.

Contribution

It proposes a novel reward method combining entropy-based and monotonicity measures to improve reasoning quality and efficiency in LLMs.

Findings

01

InfoDensity achieves comparable or better accuracy than state-of-the-art methods.

02

Models trained with InfoDensity use significantly fewer tokens.

03

The approach promotes concise, high-quality reasoning traces.

Abstract

Large Language Models (LLMs) with extended reasoning capabilities often generate verbose and redundant reasoning traces, incurring unnecessary computational cost. While existing reinforcement learning approaches address this by optimizing final response length, they neglect the quality of intermediate reasoning steps, leaving models vulnerable to reward hacking. We argue that verbosity is not merely a length problem, but a symptom of poor intermediate reasoning quality. To investigate this, we conduct an empirical study tracking the conditional entropy of the answer distribution across reasoning steps. We find that high-quality reasoning traces exhibit two consistent properties: low uncertainty convergence and monotonic progress. These findings suggest that high-quality reasoning traces are informationally dense, that is, each step contributes meaningful entropy reduction relative to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques