Generating Symbolic World Models via Test-time Scaling of Large Language Models
Zhouliang Yu, Yuhuan Yuan, Tim Z. Xiao, Fuxiang Frank Xia, Jie Fu, Ge, Zhang, Ge Lin, Weiyang Liu

TL;DR
This paper introduces a test-time scaling method for large language models to generate high-quality PDDL domain descriptions from natural language, enabling classical planning algorithms to solve complex tasks without additional training.
Contribution
The authors propose a novel test-time scaling approach that enhances LLMs' PDDL reasoning, allowing effective symbolic world model generation from natural language without extra training data.
Findings
Achieves over 50% success rate in generating PDDL domains from natural language.
Outperforms state-of-the-art methods on most planning benchmarks.
Does not require additional training data or fine-tuning.
Abstract
Solving complex planning problems requires Large Language Models (LLMs) to explicitly model the state transition to avoid rule violations, comply with constraints, and ensure optimality-a task hindered by the inherent ambiguity of natural language. To overcome such ambiguity, Planning Domain Definition Language (PDDL) is leveraged as a planning abstraction that enables precise and formal state descriptions. With PDDL, we can generate a symbolic world model where classic searching algorithms, such as A*, can be seamlessly applied to find optimal plans. However, directly generating PDDL domains with current LLMs remains an open challenge due to the lack of PDDL training data. To address this challenge, we propose to scale up the test-time computation of LLMs to enhance their PDDL reasoning capabilities, thereby enabling the generation of high-quality PDDL domains. Specifically, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
