PathWise: Planning through World Model for Automated Heuristic Design via Self-Evolving LLMs
Oguzhan Gungordu, Siheng Xiong, Faramarz Fekri

TL;DR
PathWise introduces a multi-agent framework that leverages a world model and reasoning to improve automated heuristic design for combinatorial optimization, enabling faster convergence and better generalization.
Contribution
It presents a novel planning-based approach using self-evolving LLMs with a world model, moving beyond trial-and-error to state-aware heuristic generation.
Findings
Faster convergence to high-quality heuristics
Generalizes across different LLMs and problem sizes
Scales effectively to larger combinatorial problems
Abstract
Large Language Models (LLMs) have enabled automated heuristic design (AHD) for combinatorial optimization problems (COPs), but existing frameworks' reliance on fixed evolutionary rules and static prompt templates often leads to myopic heuristic generation, redundant evaluations, and limited reasoning about how new heuristics should be derived. We propose a novel multi-agent reasoning framework, referred to as Planning through World Model for Automated Heuristic Design via Self-Evolving LLMs (PathWise), which formulates heuristic generation as a sequential decision process over an entailment graph serving as a compact, stateful memory of the search trajectory. This approach allows the system to carry forward past decisions and reuse or avoid derivation information across generations. A policy agent plans evolutionary actions, a world model agent generates heuristic rollouts conditioned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsConstraint Satisfaction and Optimization · AI-based Problem Solving and Planning · Advanced Multi-Objective Optimization Algorithms
