Plan of Thoughts: Heuristic-Guided Problem Solving with Large Language   Models

Houjun Liu

arXiv:2404.19055·cs.CL·May 1, 2024

Plan of Thoughts: Heuristic-Guided Problem Solving with Large Language Models

Houjun Liu

PDF

Open Access

TL;DR

This paper introduces a planning-based method using POMDPs and heuristics for large language models to improve multi-step reasoning, achieving higher success rates and better performance on complex tasks.

Contribution

It formalizes a POMDP-based framework for multi-step problem solving with LMs and demonstrates its effectiveness with the POMCP solver on the Game of 24.

Findings

01

Achieved 89.4% success rate on the Game of 24.

02

Outperforms existing approaches in success rate and anytime performance.

03

Provides a formal planning framework for LMs in multi-step reasoning.

Abstract

While language models (LMs) offer significant capability in zero-shot reasoning tasks across a wide range of domains, they do not perform satisfactorily in problems which requires multi-step reasoning. Previous approaches to mitigate this involves breaking a larger, multi-step task into sub-tasks and asking the language model to generate proposals ("thoughts") for each sub-task and using exhaustive planning approaches such as DFS to compose a solution. In this work, we leverage this idea to introduce two new contributions: first, we formalize a planning-based approach to perform multi-step problem solving with LMs via Partially Observable Markov Decision Processes (POMDPs), with the LM's own reflections about the value of a state used as a search heuristic; second, leveraging the online POMDP solver POMCP, we demonstrate a superior success rate of 89.4% on the Game of 24 task as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Techniques and Practices · Topic Modeling · Natural Language Processing Techniques