TL;DR
This paper introduces a safe, scalable algorithm for learning lifted action models in classical planning, ensuring reliable plan execution without prior domain knowledge, with proven correctness and efficient data requirements.
Contribution
It generalizes previous grounded domain learning methods to lifted domains, providing the first safe model-free planning algorithm with theoretical guarantees.
Findings
Successfully learned models in all tested IPC domains with at most two trajectories.
Proved the correctness of the approach and analyzed the sample complexity.
Demonstrated scalability and safety in offline learning scenarios.
Abstract
Creating a domain model, even for classical, domain-independent planning, is a notoriously hard knowledge-engineering task. A natural approach to solve this problem is to learn a domain model from observations. However, model learning approaches frequently do not provide safety guarantees: the learned model may assume actions are applicable when they are not, and may incorrectly capture actions' effects. This may result in generating plans that will fail when executed. In some domains such failures are not acceptable, due to the cost of failure or inability to replan online after failure. In such settings, all learning must be done offline, based on some observations collected, e.g., by some other agents or a human. Through this learning, the task is to generate a plan that is guaranteed to be successful. This is called the model-free planning problem. Prior work proposed an algorithm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
