Reasoning about Reasoning: BAPO Bounds on Chain-of-Thought Token Complexity in LLMs
Kiran Tomlinson, Tobias Schnabel, Adith Swaminathan, Jennifer Neville

TL;DR
This paper establishes fundamental lower bounds on the number of reasoning tokens needed for chain-of-thought in large language models, revealing linear scaling requirements and bottlenecks in inference-time compute.
Contribution
It extends the BAPO model to prove linear lower bounds on reasoning tokens for key tasks, providing theoretical insights into CoT complexity and empirical validation.
Findings
Reasoning tokens scale linearly with input size in key tasks.
Models fail when reasoning budgets are too small, aligning with theoretical bounds.
The study offers a framework to analyze optimal reasoning length in LLMs.
Abstract
Inference-time scaling via chain-of-thought (CoT) reasoning is a major driver of state-of-the-art LLM performance, but it comes with substantial latency and compute costs. We address a fundamental theoretical question: how many reasoning tokens are required to solve a problem as input size grows? By extending the bounded attention prefix oracle (BAPO) model--an abstraction of LLMs that quantifies the information flow required to solve a task--we prove lower bounds on the CoT tokens required for three canonical BAPO-hard tasks: binary majority, triplet matching, and graph reachability. We show that each requires reasoning tokens when the input size is . We complement these results with matching or near-matching upper bounds via explicit constructions. Finally, our experiments with frontier reasoning models show approximately linear reasoning token scaling on these tasks…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Machine Learning in Healthcare · Advanced Memory and Neural Computing
