Exploiting Symbolic Heuristics for the Synthesis of Domain-Specific Temporal Planning Guidance using Reinforcement Learning

Irene Brugnara; Alessandro Valentini; Andrea Micheli

arXiv:2505.13372·cs.AI·May 20, 2025

Exploiting Symbolic Heuristics for the Synthesis of Domain-Specific Temporal Planning Guidance using Reinforcement Learning

Irene Brugnara, Alessandro Valentini, Andrea Micheli

PDF

Open Access

TL;DR

This paper advances temporal planning by integrating symbolic heuristics with reinforcement learning, enabling more effective heuristic guidance through residual learning and combined search strategies, thereby improving planning performance.

Contribution

It introduces a novel framework that exploits symbolic heuristics during RL and planning, including residual learning and multiple-queue search, to enhance domain-specific temporal planning.

Findings

01

Residual heuristic learning improves guidance accuracy.

02

Combining symbolic and learned heuristics balances search efficiency.

03

Experimental results outperform previous approaches.

Abstract

Recent work investigated the use of Reinforcement Learning (RL) for the synthesis of heuristic guidance to improve the performance of temporal planners when a domain is fixed and a set of training problems (not plans) is given. The idea is to extract a heuristic from the value function of a particular (possibly infinite-state) MDP constructed over the training problems. In this paper, we propose an evolution of this learning and planning framework that focuses on exploiting the information provided by symbolic heuristics during both the RL and planning phases. First, we formalize different reward schemata for the synthesis and use symbolic heuristics to mitigate the problems caused by the truncation of episodes needed to deal with the potentially infinite MDP. Second, we propose learning a residual of an existing symbolic heuristic, which is a "correction" of the heuristic value,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI-based Problem Solving and Planning · Reinforcement Learning in Robotics · Robotic Path Planning Algorithms

MethodsSparse Evolutionary Training