Planning of Heuristics: Strategic Planning on Large Language Models with Monte Carlo Tree Search for Automating Heuristic Optimization
Hui Wang, Xufeng Zhang, Chaoxu Mu

TL;DR
This paper introduces PoH, a novel method combining LLMs and Monte Carlo Tree Search to automatically optimize heuristics for combinatorial problems, outperforming existing methods especially on large instances.
Contribution
The paper presents a new approach that integrates LLM self-reflection with MCTS to iteratively refine heuristics for combinatorial optimization problems.
Findings
PoH outperforms hand-crafted heuristics.
PoH achieves state-of-the-art results on tested COPs.
Effective for large problem instances.
Abstract
Heuristics have achieved great success in solving combinatorial optimization problems~(COPs). However, heuristics designed by humans require too much domain knowledge and testing time. Since Large Language Models~(LLMs) possess strong capabilities to understand and generate content with a knowledge base that covers various domains, they offer potential ways to automatically optimize heuristics. To this end, we propose Planning of Heuristics~(PoH), an optimization method that integrates LLM self-reflection with Monte Carlo Tree Search, a well-known planning algorithm. PoH iteratively refines generated heuristics by evaluating their performance and providing improvement suggestions. Our method enables to iteratively evaluate the generated heuristics~(states) and improve them based on the improvement suggestions~(actions) and evaluation results~(rewards), by effectively simulating future…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Techniques and Practices · Service-Oriented Architecture and Web Services · Edcuational Technology Systems
MethodsBalanced Selection
