Learning Neural Search Policies for Classical Planning

Pawel Gomoluch; Dalal Alrajeh; Alessandra Russo; Antonio Bucchiarone

arXiv:1911.12200·cs.AI·November 28, 2019

Learning Neural Search Policies for Classical Planning

Pawel Gomoluch, Dalal Alrajeh, Alessandra Russo, Antonio Bucchiarone

PDF

TL;DR

This paper introduces a neural approach to dynamically adapt search algorithms in classical planning, enabling more effective problem-solving by learning policies that modify search parameters during execution.

Contribution

It proposes a parametrized search algorithm template combined with neural policies to adapt search strategies in classical planning, surpassing fixed or handcrafted methods.

Findings

01

Neural policies effectively adapt search parameters during planning.

02

The approach outperforms baseline methods on distribution-specific problems.

03

The method learns to optimize planner performance through the cross-entropy training.

Abstract

Heuristic forward search is currently the dominant paradigm in classical planning. Forward search algorithms typically rely on a single, relatively simple variation of best-first search and remain fixed throughout the process of solving a planning problem. Existing work combining multiple search techniques usually aims at supporting best-first search with an additional exploratory mechanism, triggered using a handcrafted criterion. A notable exception is very recent work which combines various search techniques using a trainable policy. It is, however, confined to a discrete action space comprising several fixed subroutines. In this paper, we introduce a parametrized search algorithm template which combines various search techniques within a single routine. The template's parameter space defines an infinite space of search algorithms, including, among others, BFS, local and random…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.