Your Model Diversity, Not Method, Determines Reasoning Strategy
Moulik Choraria, Argyrios Gerogiannis, Anirban Das, Supriyo Chakraborty, Berkcan Kapusuzoglu, Chia-Hsuan Lee, Kartik Balasubramaniam, Shi-Xiong Zhang, Sambit Sahu

TL;DR
This paper argues that the effectiveness of reasoning strategies in large language models depends on the models' diversity profile, emphasizing the importance of model diversity over the specific method used.
Contribution
It introduces a theoretical framework linking model diversity to reasoning strategy effectiveness and validates it across different model families.
Findings
Depth refinement works well for low-diversity models with lightweight signals.
High-diversity models require stronger signals for effective depth-based refinement.
Model diversity profile influences the choice of reasoning exploration strategies.
Abstract
Compute scaling for LLM reasoning requires allocating budget between exploring solution approaches () and refining promising solutions (). Most methods implicitly trade off one for the other, yet why a given trade-off works remains unclear, and validation on a single model obscures the role of the model itself. We argue that We formalize this through a theoretical framework decomposing reasoning uncertainty and derive conditions under which tree-style depth refinement outperforms parallel sampling. We validate it on Qwen-3 4B and Olmo-3 7B families, showing that lightweight signals suffice for depth-based refinement on low-diversity aligned models while yielding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
