Policy-Guided Search on Tree-of-Thoughts for Efficient Problem Solving with Bounded Language Model Queries
Sumedh Pendurkar, Guni Sharon

TL;DR
This paper introduces a policy-guided search method using Levin Tree Search within the Tree-of-Thoughts framework to improve problem-solving efficiency of language models under limited computational budgets, with theoretical guarantees and empirical validation.
Contribution
It adapts Levin Tree Search to the Tree-of-Thoughts framework, providing theoretical bounds and demonstrating improved efficiency and accuracy in resource-constrained problem-solving tasks.
Findings
LTS achieves comparable or higher accuracy than baseline methods.
LTS reduces the number of thought evaluations needed.
Theoretical bounds on the number of states expanded are established.
Abstract
Recent studies explored integrating state-space search algorithms with Language Models (LM) to perform look-ahead on the token generation process, the ''Tree-of-Thoughts'' (ToT), generated by LMs, thereby improving performance on problem-solving tasks. However, the affiliated search algorithms often overlook the significant computational costs associated with LM inference, particularly in scenarios with constrained computational budgets. Consequently, we address the problem of improving LM performance on problem-solving tasks under limited computational budgets. We demonstrate how the probabilities assigned to thoughts by LMs can serve as a heuristic to guide search within the ToT framework, thereby reducing the number of thought evaluations. Building on this insight, we adapt a heuristic search algorithm, Levin Tree Search (LTS), to the ToT framework, which leverages LMs as policies to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Big Data and Digital Economy · Machine Learning and Algorithms
