Self-Improvement for Fast, High-Quality Plan Generation

Robert Gieselmann; Henrike von Huelsen; Mihai Samson; Marie-Christine Meyer; Dariusz Piotrowski; Oleksandr Radomskyi; Justin Okamoto; Turan Gojayev; Michael Painter; Gavin Brown; Federico Pecora; Jeremy L. Wyatt

arXiv:2605.03625·cs.AI·May 6, 2026

Self-Improvement for Fast, High-Quality Plan Generation

Robert Gieselmann, Henrike von Huelsen, Mihai Samson, Marie-Christine Meyer, Dariusz Piotrowski, Oleksandr Radomskyi, Justin Okamoto, Turan Gojayev, Michael Painter, Gavin Brown, Federico Pecora, Jeremy L. Wyatt

PDF

TL;DR

This paper presents a method using self-improving generative models to produce high-quality plans efficiently, outperforming traditional symbolic planners in plan length and scalability across multiple domains.

Contribution

It introduces a self-improvement framework for transformer-based models that iteratively enhances plan quality through combined model and graph search techniques.

Findings

01

30% reduction in plan length compared to symbolic planners

02

Over 80% of generated plans are optimal

03

Model latency scales sub-exponentially

Abstract

Generative models trained on synthetic plan data are a promising approach to generalized planning. Recent work has focused on finding any valid plan, rather than a high-quality solution. We address the challenge of producing high-quality plans, a computationally hard problem, in sub-exponential time. First, we demonstrate that, given optimal data, a decoder-only transformer can generate high-quality plans for unseen problem instances. Second, we show how to self-improve an initial model trained on sub-optimal data. Each round of self-improvement combines multiple model calls with graph search to generate improved plans, used for model fine-tuning. An experimental study on four domains: Blocksworld, Logistics, Labyrinth, and Sokoban, shows on average a 30% reduction in plan length over the source symbolic planner, with over 80% of plans being optimal, where the optimum is known. Plan…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.