Mean-based Heuristic Search for Real-Time Planning

Damien Pellier; Bruno Bouzy; Marc M\'etivier

arXiv:1810.09150·cs.AI·October 23, 2018

Mean-based Heuristic Search for Real-Time Planning

Damien Pellier, Bruno Bouzy, Marc M\'etivier

PDF

Open Access

TL;DR

This paper presents MHSP, a novel mean-based heuristic search algorithm for real-time planning that combines bandit algorithms with heuristic search, demonstrating faster and higher-quality plans than existing methods.

Contribution

Introduction of MHSP, a new real-time planning algorithm combining UCT principles with heuristic search, showing improved speed and plan quality.

Findings

01

MHSP produces plans faster than existing algorithms.

02

Plans generated by MHSP tend to be closer to optimal over time.

03

MHSP outperforms in various planning problem evaluations.

Abstract

In this paper, we introduce a new heuristic search algorithm based on mean values for real-time planning, called MHSP. It consists in associating the principles of UCT, a bandit-based algorithm which gave very good results in computer games, and especially in Computer Go, with heuristic search in order to obtain a real-time planner in the context of classical planning. MHSP is evaluated on different planning problems and compared to existing algorithms performing on-line search and learning. Besides, our results highlight the capacity of MHSP to return plans in a real-time manner which tend to an optimal plan over the time which is faster and of better quality compared to existing algorithms in the literature.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games · AI-based Problem Solving and Planning · Robotic Path Planning Algorithms