Large Language Models as Planning Domain Generators

James Oswald; Kavitha Srinivas; Harsha Kokel; Junkyu Lee; Michael; Katz; Shirin Sohrabi

arXiv:2405.06650·cs.CL·May 14, 2024·1 cites

Large Language Models as Planning Domain Generators

James Oswald, Kavitha Srinivas, Harsha Kokel, Junkyu Lee, Michael, Katz, Shirin Sohrabi

PDF

Open Access 1 Repo

TL;DR

This paper explores using large language models to automatically generate planning domain models from natural language descriptions, aiming to reduce manual effort in AI planning.

Contribution

It introduces a framework for evaluating LLM-generated planning domains and empirically analyzes multiple models across various domains and descriptions.

Findings

01

High-parameter LLMs show moderate proficiency in domain generation.

02

Evaluation framework compares plan sets for generated domains.

03

Models perform better with detailed natural language descriptions.

Abstract

Developing domain models is one of the few remaining places that require manual human labor in AI planning. Thus, in order to make planning more accessible, it is desirable to automate the process of domain model generation. To this end, we investigate if large language models (LLMs) can be used to generate planning domain models from simple textual descriptions. Specifically, we introduce a framework for automated evaluation of LLM-generated domains by comparing the sets of plans for domain instances. Finally, we perform an empirical analysis of 7 large language models, including coding and chat models across 9 different planning domains, and under three classes of natural language domain descriptions. Our results indicate that LLMs, particularly those with high parameter counts, exhibit a moderate level of proficiency in generating correct planning domains from natural language…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

IBM/NL2PDDL
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling