Sample Complexity of Learning Heuristic Functions for Greedy-Best-First and A* Search
Shinsaku Sakaue, Taihei Oki

TL;DR
This paper analyzes the sample complexity of learning heuristic functions for greedy best-first search and A* algorithms, providing bounds on the pseudo-dimension that influence the amount of data needed for effective learning.
Contribution
It introduces upper and lower bounds on the pseudo-dimension for learned heuristics in GBFS and A*, improving understanding of their sample complexity.
Findings
Upper bounds of O(n log n) for GBFS and O(n^2 log n) for A* on pseudo-dimension.
Lower bounds of Ω(n) indicating bounds are tight up to a log factor.
Improved bounds for A* when vertex degree or edge weights are bounded.
Abstract
Greedy best-first search (GBFS) and A* search (A*) are popular algorithms for path-finding on large graphs. Both use so-called heuristic functions, which estimate how close a vertex is to the goal. While heuristic functions have been handcrafted using domain knowledge, recent studies demonstrate that learning heuristic functions from data is effective in many applications. Motivated by this emerging approach, we study the sample complexity of learning heuristic functions for GBFS and A*. We build on a recent framework called \textit{data-driven algorithm design} and evaluate the \textit{pseudo-dimension} of a class of utility functions that measure the performance of parameterized algorithms. Assuming that a vertex set of size is fixed, we present and upper bounds on the pseudo-dimensions for GBFS and A*, respectively, parameterized by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsData Management and Algorithms · Machine Learning and Algorithms · Vehicle Routing Optimization Methods
