Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Shunyu Yao, Dian Yu, Jeffrey Zhao, Izhak Shafran, Thomas L. Griffiths,, Yuan Cao, Karthik Narasimhan

TL;DR
The paper introduces Tree of Thoughts (ToT), a novel framework for language model inference that enhances problem-solving by enabling exploration, strategic decision-making, and backtracking over intermediate reasoning steps, significantly improving performance on complex tasks.
Contribution
It generalizes chain-of-thought prompting into a tree-based exploration framework, allowing deliberate decision-making and planning in language models for complex problem solving.
Findings
GPT-4 with ToT solved 74% of Game of 24 tasks versus 4% with chain-of-thought.
ToT improved problem-solving on tasks requiring planning and search.
Code and prompts are publicly available.
Abstract
Language models are increasingly being deployed for general problem solving across a wide range of tasks, but are still confined to token-level, left-to-right decision-making processes during inference. This means they can fall short in tasks that require exploration, strategic lookahead, or where initial decisions play a pivotal role. To surmount these challenges, we introduce a new framework for language model inference, Tree of Thoughts (ToT), which generalizes over the popular Chain of Thought approach to prompting language models, and enables exploration over coherent units of text (thoughts) that serve as intermediate steps toward problem solving. ToT allows LMs to perform deliberate decision making by considering multiple different reasoning paths and self-evaluating choices to decide the next course of action, as well as looking ahead or backtracking when necessary to make…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Adam · Residual Connection · Dense Connections · Dropout · Byte Pair Encoding · Softmax · Layer Normalization
