Towards Learning Foundation Models for Heuristic Functions to Solve   Pathfinding Problems

Vedant Khandelwal; Amit Sheth; Forest Agostinelli

arXiv:2406.02598·cs.LG·June 6, 2024

Towards Learning Foundation Models for Heuristic Functions to Solve Pathfinding Problems

Vedant Khandelwal, Amit Sheth, Forest Agostinelli

PDF

Open Access

TL;DR

This paper presents a foundation model using deep reinforcement learning to generate adaptable heuristic functions for pathfinding, enabling efficient solving of diverse and unseen domains without additional training.

Contribution

The study introduces a novel foundation model that leverages deep reinforcement learning to produce domain-adaptive heuristic functions for pathfinding problems.

Findings

01

Strong correlation between learned and true heuristics across domains

02

Model generalizes well to unseen pathfinding problems

03

Improved adaptability over traditional domain-specific methods

Abstract

Pathfinding problems are found throughout robotics, computational science, and natural sciences. Traditional methods to solve these require training deep neural networks (DNNs) for each new problem domain, consuming substantial time and resources. This study introduces a novel foundation model, leveraging deep reinforcement learning to train heuristic functions that seamlessly adapt to new domains without further fine-tuning. Building upon DeepCubeA, we enhance the model by providing the heuristic function with the domain's state transition information, improving its adaptability. Utilizing a puzzle generator for the 15-puzzle action space variation domains, we demonstrate our model's ability to generalize and solve unseen domains. We achieve a strong correlation between learned and ground truth heuristic values across various domains, as evidenced by robust R-squared and Concordance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Model-Driven Software Engineering Techniques