Open Grounded Planning: Challenges and Benchmark Construction

Shiguang Guo; Ziliang Deng; Hongyu Lin; Yaojie Lu; Xianpei Han; Le Sun

arXiv:2406.02903·cs.CL·June 6, 2024

Open Grounded Planning: Challenges and Benchmark Construction

Shiguang Guo, Ziliang Deng, Hongyu Lin, Yaojie Lu, Xianpei Han, Le Sun

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces open grounded planning, a new task for generating executable plans based on variable action sets, and constructs a benchmark to evaluate LLMs and planning methods across diverse domains.

Contribution

It defines the open grounded planning task, creates a comprehensive benchmark dataset, and evaluates current LLMs, highlighting existing challenges and future research directions.

Findings

01

Current LLMs struggle with open grounded planning tasks.

02

Existing methods are insufficient for generating executable plans in open domains.

03

The benchmark reveals significant gaps in LLM capabilities for grounded planning.

Abstract

The emergence of large language models (LLMs) has increasingly drawn attention to the use of LLMs for human-like planning. Existing work on LLM-based planning either focuses on leveraging the inherent language generation capabilities of LLMs to produce free-style plans, or employs reinforcement learning approaches to learn decision-making for a limited set of actions within restricted environments. However, both approaches exhibit significant discrepancies from the open and executable requirements in real-world planning. In this paper, we propose a new planning task--open grounded planning. The primary objective of open grounded planning is to ask the model to generate an executable plan based on a variable action set, thereby ensuring the executability of the produced plan. To this end, we establishes a benchmark for open grounded planning spanning a wide range of domains. Then we test…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shiguang-guo/open-grounded-planning
none

Videos

Open Grounded Planning: Challenges and Benchmark Construction· underline

Taxonomy

TopicsEducation Systems and Policy

MethodsSparse Evolutionary Training