Learning Neural Search Policies for Classical Planning
Pawel Gomoluch, Dalal Alrajeh, Alessandra Russo, Antonio Bucchiarone

TL;DR
This paper introduces a neural approach to dynamically adapt search algorithms in classical planning, enabling more effective problem-solving by learning policies that modify search parameters during execution.
Contribution
It proposes a parametrized search algorithm template combined with neural policies to adapt search strategies in classical planning, surpassing fixed or handcrafted methods.
Findings
Neural policies effectively adapt search parameters during planning.
The approach outperforms baseline methods on distribution-specific problems.
The method learns to optimize planner performance through the cross-entropy training.
Abstract
Heuristic forward search is currently the dominant paradigm in classical planning. Forward search algorithms typically rely on a single, relatively simple variation of best-first search and remain fixed throughout the process of solving a planning problem. Existing work combining multiple search techniques usually aims at supporting best-first search with an additional exploratory mechanism, triggered using a handcrafted criterion. A notable exception is very recent work which combines various search techniques using a trainable policy. It is, however, confined to a discrete action space comprising several fixed subroutines. In this paper, we introduce a parametrized search algorithm template which combines various search techniques within a single routine. The template's parameter space defines an infinite space of search algorithms, including, among others, BFS, local and random…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
