Safe Learning of Lifted Action Models

Brendan Juba; Hai S. Le; Roni Stern

arXiv:2107.04169·cs.AI·July 12, 2021

Safe Learning of Lifted Action Models

Brendan Juba, Hai S. Le, Roni Stern

PDF

1 Repo

TL;DR

This paper introduces a safe, scalable algorithm for learning lifted action models in classical planning, ensuring reliable plan execution without prior domain knowledge, with proven correctness and efficient data requirements.

Contribution

It generalizes previous grounded domain learning methods to lifted domains, providing the first safe model-free planning algorithm with theoretical guarantees.

Findings

01

Successfully learned models in all tested IPC domains with at most two trajectories.

02

Proved the correctness of the approach and analyzed the sample complexity.

03

Demonstrated scalability and safety in offline learning scenarios.

Abstract

Creating a domain model, even for classical, domain-independent planning, is a notoriously hard knowledge-engineering task. A natural approach to solve this problem is to learn a domain model from observations. However, model learning approaches frequently do not provide safety guarantees: the learned model may assume actions are applicable when they are not, and may incorrectly capture actions' effects. This may result in generating plans that will fail when executed. In some domains such failures are not acceptable, due to the cost of failure or inability to replan online after failure. In such settings, all learning must be done offline, based on some observations collected, e.g., by some other agents or a human. Through this learning, the task is to generate a plan that is guaranteed to be successful. This is called the model-free planning problem. Prior work proposed an algorithm…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hsle/sam-learning
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.