Understanding When Tree of Thoughts Succeeds: Larger Models Excel in Generation, Not Discrimination
Qiqi Chen, Xinpeng Wang, Philipp Mondorf, Michael A. Hedderich,, Barbara Plank

TL;DR
This paper investigates the Tree of Thoughts reasoning strategy for Large Language Models, revealing that larger models improve generation quality more than discrimination, and the generator's role is more crucial than the discriminator's.
Contribution
The study disentangles the roles of generator and discriminator in ToT, highlighting the importance of generator scaling for improved reasoning performance.
Findings
Scaling the generator significantly boosts ToT success.
Discriminator scaling yields marginal gains.
Models have similar discrimination but varied generative capabilities.
Abstract
Tree of Thoughts (ToT) is a reasoning strategy for Large Language Models (LLMs) that employs a generator to suggest reasoning steps and a discriminator to decide which steps to implement. ToT demonstrates strong performance on reasoning tasks, often surpassing simple methods such as Input-Output (IO) prompting and Chain-of-Thought (CoT) reasoning. However, ToT does not consistently outperform such simpler methods across all models, leaving large knowledge gaps on the conditions under which ToT is most beneficial. In this paper, we analyze the roles of the generator and discriminator separately to better understand the conditions when ToT is beneficial. We find that the generator plays a more critical role than the discriminator in driving the success of ToT. Scaling the generator leads to notable improvements in ToT performance, even when using a smaller model as the discriminator,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSports Analytics and Performance · Spreadsheets and End-User Computing · Transportation and Mobility Innovations
