Large Language Models as Commonsense Knowledge for Large-Scale Task Planning
Zirui Zhao, Wee Sun Lee, David Hsu

TL;DR
This paper introduces LLM-MCTS, a novel approach that combines large language models as a world model and policy to enhance large-scale task planning, significantly outperforming previous methods on complex tasks.
Contribution
The paper presents a new method integrating LLMs with Monte Carlo Tree Search, leveraging LLMs as both a world model and policy to improve planning efficiency and effectiveness.
Findings
LLM-MCTS outperforms standalone MCTS and LLM-induced policies on complex tasks.
Using LLMs as a world model provides a strong prior for effective reasoning.
Minimum description length guides when LLM-based world models are preferable.
Abstract
Large-scale task planning is a major challenge. Recent work exploits large language models (LLMs) directly as a policy and shows surprisingly interesting results. This paper shows that LLMs provide a commonsense model of the world in addition to a policy that acts on it. The world model and the policy can be combined in a search algorithm, such as Monte Carlo Tree Search (MCTS), to scale up task planning. In our new LLM-MCTS algorithm, the LLM-induced world model provides a commonsense prior belief for MCTS to achieve effective reasoning; the LLM-induced policy acts as a heuristic to guide the search, vastly improving search efficiency. Experiments show that LLM-MCTS outperforms both MCTS alone and policies induced by LLMs (GPT2 and GPT3.5) by a wide margin, for complex, novel tasks. Further experiments and analyses on multiple tasks -- multiplication, multi-hop travel planning, object…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
