Integrating Reinforcement Learning, Action Model Learning, and Numeric   Planning for Tackling Complex Tasks

Yarin Benyamin; Argaman Mordoch; Shahaf S. Shperberg; Roni Stern

arXiv:2502.13006·cs.AI·February 19, 2025

Integrating Reinforcement Learning, Action Model Learning, and Numeric Planning for Tackling Complex Tasks

Yarin Benyamin, Argaman Mordoch, Shahaf S. Shperberg, Roni Stern

PDF

Open Access 1 Repo

TL;DR

This paper explores combining reinforcement learning, action model learning, and numeric planning to solve complex tasks in environments like Minecraft, demonstrating advantages over purely model-free approaches.

Contribution

It introduces a hybrid approach that learns numeric domain models and integrates them with reinforcement learning, improving long-term planning and generalization capabilities.

Findings

01

NSAM_(+p) enables solving long-term planning tasks in larger environments.

02

RAMP improves planning efficiency and problem-solving success over RL baselines.

03

Hybrid methods outperform model-free algorithms in complex, numeric environments.

Abstract

Automated Planning algorithms require a model of the domain that specifies the preconditions and effects of each action. Obtaining such a domain model is notoriously hard. Algorithms for learning domain models exist, yet it remains unclear whether learning a domain model and planning is an effective approach for numeric planning environments, i.e., where states include discrete and numeric state variables. In this work, we explore the benefits of learning a numeric domain model and compare it with alternative model-free solutions. As a case study, we use two tasks in Minecraft, a popular sandbox game that has been used as an AI challenge. First, we consider an offline learning setting, where a set of expert trajectories are available to learn from. This is the standard setting for learning domain models. We used the Numeric Safe Action Model Learning (NSAM) algorithm to learn a numeric…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

spl-bgu/polyplan
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsSparse Evolutionary Training