Reasoning as Compression: Unifying Budget Forcing via the Conditional Information Bottleneck
Fabio Valerio Massoli, Andrey Kuzmin, Arash Behboodi

TL;DR
This paper introduces a novel framework based on the Conditional Information Bottleneck (CIB) principle to optimize reasoning in large language models by balancing accuracy and token efficiency through semantic priors.
Contribution
It formulates reasoning as a compression problem under CIB, addressing attention violations and proposing a semantic prior for token cost, improving reasoning efficiency without sacrificing accuracy.
Findings
CIB prunes reasoning redundancy effectively.
Semantic prior improves token compression with minimal accuracy loss.
Framework generalizes across models and tasks.
Abstract
\ac{CoT} prompting improves LLM accuracy on complex tasks but often increases token usage and inference cost. Existing ``Budget Forcing'' methods reduce cost via fine-tuning with heuristic length penalties, suppressing both essential reasoning and redundant filler. We recast efficient reasoning as a lossy compression problem under the \ac{IB} principle, and identify a key theoretical gap when applying naive \ac{IB} to transformers: attention violates the Markov property between prompt, reasoning trace, and response. To resolve this issue, we model \ac{CoT} generation under the \ac{CIB} principle, where the reasoning trace acts as a computational bridge that contains only the information about the response that is not directly accessible from the prompt . This yields a general Reinforcement Learning objective: maximize task reward while compressing completions under a prior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)
