Exploiting Learned Policies in Focal Search
Pablo Araneda, Matias Greco, Jorge A. Baier

TL;DR
This paper explores integrating neural network policies into Focal Search to improve bounded-suboptimal search efficiency, providing mathematical foundations and empirical evaluations across benchmarks and the 15-puzzle.
Contribution
It introduces novel methods for embedding learned policies into Focal Search with theoretical support and evaluates their performance on synthetic and real-world problems.
Findings
Discrepancy Focal Search outperforms other variants in runtime and solution quality.
The approach effectively leverages neural policies with varying accuracy levels.
Mathematical foundations support the proposed algorithms.
Abstract
Recent machine-learning approaches to deterministic search and domain-independent planning employ policy learning to speed up search. Unfortunately, when attempting to solve a search problem by successively applying a policy, no guarantees can be given on solution quality. The problem of how to effectively use a learned policy within a bounded-suboptimal search algorithm remains largely as an open question. In this paper, we propose various ways in which such policies can be integrated into Focal Search, assuming that the policy is a neural network classifier. Furthermore, we provide mathematical foundations for some of the resulting algorithms. To evaluate the resulting algorithms over a number of policies with varying accuracy, we use synthetic policies which can be generated for a target accuracy for problems where the search space can be held in memory. We evaluate our focal search…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · AI-based Problem Solving and Planning · Reservoir Engineering and Simulation Methods
