Agentic LLM Planning via Step-Wise PDDL Simulation: An Empirical Characterisation
Kai G\"obel, Pierrick Lorang, Patrik Zips, and Tobias Gl\"uck

TL;DR
This paper investigates the use of large language models as interactive, agentic planners in task planning, comparing their performance to classical methods using a new PDDL simulation engine and highlighting the importance of environmental feedback.
Contribution
Introduces PyPDDLEngine, an open-source PDDL simulation engine enabling LLMs to perform step-wise, agentic planning with interactive feedback, bridging symbolic planning and language models.
Findings
Agentic LLM planning achieves 66.7% success rate.
Classical planners outperform LLMs in success rate.
LLMs produce shorter plans than classical methods.
Abstract
Task planning, the problem of sequencing actions to reach a goal from an initial state, is a core capability requirement for autonomous robotic systems. Whether large language models (LLMs) can serve as viable planners alongside classical symbolic methods remains an open question. We present PyPDDLEngine, an open-source Planning Domain Definition Language (PDDL) simulation engine that exposes planning operations as LLM tool calls through a Model Context Protocol (MCP) interface. Rather than committing to a complete action sequence upfront, the LLM acts as an interactive search policy that selects one action at a time, observes each resulting state, and can reset and retry. We evaluate four approaches on 102 International Planning Competition (IPC) Blocksworld instances under a uniform 180-second budget: Fast Downward lama-first and seq-sat-lama-2011 as classical baselines, direct LLM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI-based Problem Solving and Planning · Multimodal Machine Learning Applications · Robotic Path Planning Algorithms
