Agentic LLM Planning via Step-Wise PDDL Simulation: An Empirical Characterisation

Kai G\"obel; Pierrick Lorang; Patrik Zips; and Tobias Gl\"uck

arXiv:2603.06064·cs.AI·March 9, 2026

Agentic LLM Planning via Step-Wise PDDL Simulation: An Empirical Characterisation

Kai G\"obel, Pierrick Lorang, Patrik Zips, and Tobias Gl\"uck

PDF

Open Access

TL;DR

This paper investigates the use of large language models as interactive, agentic planners in task planning, comparing their performance to classical methods using a new PDDL simulation engine and highlighting the importance of environmental feedback.

Contribution

Introduces PyPDDLEngine, an open-source PDDL simulation engine enabling LLMs to perform step-wise, agentic planning with interactive feedback, bridging symbolic planning and language models.

Findings

01

Agentic LLM planning achieves 66.7% success rate.

02

Classical planners outperform LLMs in success rate.

03

LLMs produce shorter plans than classical methods.

Abstract

Task planning, the problem of sequencing actions to reach a goal from an initial state, is a core capability requirement for autonomous robotic systems. Whether large language models (LLMs) can serve as viable planners alongside classical symbolic methods remains an open question. We present PyPDDLEngine, an open-source Planning Domain Definition Language (PDDL) simulation engine that exposes planning operations as LLM tool calls through a Model Context Protocol (MCP) interface. Rather than committing to a complete action sequence upfront, the LLM acts as an interactive search policy that selects one action at a time, observes each resulting state, and can reset and retry. We evaluate four approaches on 102 International Planning Competition (IPC) Blocksworld instances under a uniform 180-second budget: Fast Downward lama-first and seq-sat-lama-2011 as classical baselines, direct LLM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI-based Problem Solving and Planning · Multimodal Machine Learning Applications · Robotic Path Planning Algorithms