TransPath: Learning Heuristics For Grid-Based Pathfinding via Transformers
Daniil Kirilenko, Anton Andreychuk, Aleksandr Panov, Konstantin, Yakovlev

TL;DR
This paper introduces TransPath, a transformer-based approach to learn instance-dependent heuristics for grid pathfinding, significantly improving search efficiency while maintaining near-optimal solutions.
Contribution
It proposes novel heuristics learned via transformers, including a correction factor and path probability, to enhance A* search in obstacle-rich environments.
Findings
Up to 4x reduction in A* computational effort
Solutions within 0.3% of optimal cost on average
Outperforms traditional and state-of-the-art learned planners
Abstract
Heuristic search algorithms, e.g. A*, are the commonly used tools for pathfinding on grids, i.e. graphs of regular structure that are widely employed to represent environments in robotics, video games etc. Instance-independent heuristics for grid graphs, e.g. Manhattan distance, do not take the obstacles into account and, thus, the search led by such heuristics performs poorly in the obstacle-rich environments. To this end, we suggest learning the instance-dependent heuristic proxies that are supposed to notably increase the efficiency of the search. The first heuristic proxy we suggest to learn is the correction factor, i.e. the ratio between the instance independent cost-to-go estimate and the perfect one (computed offline at the training phase). Unlike learning the absolute values of the cost-to-go heuristic function, which was known before, when learning the correction factor the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsRobotic Path Planning Algorithms · Artificial Intelligence in Games
