Leveraging Pre-trained Large Language Models to Construct and Utilize   World Models for Model-based Task Planning

Lin Guan; Karthik Valmeekam; Sarath Sreedharan; Subbarao Kambhampati

arXiv:2305.14909·cs.AI·November 3, 2023·34 cites

Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Planning

Lin Guan, Karthik Valmeekam, Sarath Sreedharan, Subbarao Kambhampati

PDF

Open Access 1 Video

TL;DR

This paper proposes a method that uses large language models to create and refine explicit world models in PDDL, enabling sound planning with external domain-independent planners, reducing human effort and improving plan correctness.

Contribution

It introduces a novel framework that constructs and corrects PDDL domain models using LLMs as interfaces, enhancing planning accuracy and reducing human involvement.

Findings

01

GPT-4 can generate high-quality PDDL models for over 40 actions.

02

Corrected PDDL models successfully solve 48 challenging planning tasks.

03

The approach outperforms previous methods in complex domains.

Abstract

There is a growing interest in applying pre-trained large language models (LLMs) to planning problems. However, methods that use LLMs directly as planners are currently impractical due to several factors, including limited correctness of plans, strong reliance on feedback from interactions with simulators or even the actual environment, and the inefficiency in utilizing human feedback. In this work, we introduce a novel alternative paradigm that constructs an explicit world (domain) model in planning domain definition language (PDDL) and then uses it to plan with sound domain-independent planners. To address the fact that LLMs may not generate a fully functional PDDL model initially, we employ LLMs as an interface between PDDL and sources of corrective feedback, such as PDDL validators and humans. For users who lack a background in PDDL, we show that LLMs can translate PDDL into natural…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Planning· slideslive

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsMulti-Head Attention · Attention Is All You Need · Dense Connections · Dropout · Byte Pair Encoding · Softmax · Layer Normalization · Position-Wise Feed-Forward Layer · Linear Layer · Absolute Position Encodings