Learning to Select Goals in Automated Planning with Deep-Q Learning

Carlos N\'u\~nez-Molina; Juan Fern\'andez-Olivares; Ra\'ul P\'erez

arXiv:2406.14779·cs.AI·June 24, 2024

Learning to Select Goals in Automated Planning with Deep-Q Learning

Carlos N\'u\~nez-Molina, Juan Fern\'andez-Olivares, Ra\'ul P\'erez

PDF

TL;DR

This paper introduces a deep reinforcement learning-based goal selection module for automated planning, improving efficiency and generalization in real-time scenarios compared to classical and standard deep learning methods.

Contribution

It presents a novel architecture integrating Deep Q-Learning for subgoal selection, enhancing planning efficiency and generalization in complex environments.

Findings

01

Outperforms classical planners in plan quality and speed.

02

More sample-efficient than standard Deep Q-Learning.

03

Generalizes better across different game levels.

Abstract

In this work we propose a planning and acting architecture endowed with a module which learns to select subgoals with Deep Q-Learning. This allows us to decrease the load of a planner when faced with scenarios with real-time restrictions. We have trained this architecture on a video game environment used as a standard test-bed for intelligent systems applications, testing it on different levels of the same game to evaluate its generalization abilities. We have measured the performance of our approach as more training data is made available, as well as compared it with both a state-of-the-art, classical planner and the standard Deep Q-Learning algorithm. The results obtained show our model performs better than the alternative methods considered, when both plan quality (plan length) and time requirements are taken into account. On the one hand, it is more sample-efficient than standard…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsQ-Learning