Language Model Planners do not Scale, but do Formalizers?

Owen Jiang; Cassie Huang; Ashish Sabharwal; Li Zhang

arXiv:2603.23844·cs.CL·March 26, 2026

Language Model Planners do not Scale, but do Formalizers?

Owen Jiang, Cassie Huang, Ashish Sabharwal, Li Zhang

PDF

Open Access

TL;DR

While LLM planners struggle with complex problems, LLM formalizers significantly outperform them, especially with divide-and-conquer strategies and higher-order formalization to handle exponential complexity.

Contribution

The paper demonstrates that LLM formalizers outperform planners in complex domains, introduces a divide-and-conquer formalization technique, and proposes a new paradigm using higher-order formalizers to manage exponential problem complexity.

Findings

01

LLM formalizers retain perfect accuracy in BlocksWorld with large state spaces.

02

Divide-and-conquer formalization improves robustness against problem complexity.

03

Higher-order formalizers effectively handle exponential formalization challenges.

Abstract

Recent work shows overwhelming evidence that LLMs, even those trained to scale their reasoning trace, perform unsatisfactorily when solving planning problems too complex. Whether the same conclusion holds for LLM formalizers that generate solver-oriented programs remains unknown. We systematically show that LLM formalizers greatly out-scale LLM planners, some retaining perfect accuracy in the classic BlocksWorld domain with a huge state space of size up to $1 0^{165}$ . While performance of smaller LLM formalizers degrades with problem complexity, we show that a divide-and-conquer formalizing technique can greatly improve its robustness. Finally, we introduce unraveling problems where one line of problem description realistically corresponds to exponentially many lines of formal language such as the Planning Domain Definition Language (PDDL), greatly challenging LLM formalizers. We tackle…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI-based Problem Solving and Planning · Logic, Reasoning, and Knowledge · Constraint Satisfaction and Optimization