RAPid-Learn: A Framework for Learning to Recover for Handling Novelties in Open-World Environments
Shivam Goel, Yash Shukla, Vasanth Sarathy, Matthias Scheutz, Jivko, Sinapov

TL;DR
RAPid-Learn is a hybrid planning and learning framework that enables agents to adapt quickly to environmental novelties by updating their models and learning new action executors, demonstrated in a Minecraft-inspired gridworld.
Contribution
It introduces a novel hybrid method for on-the-fly adaptation to environmental changes, combining planning, learning, and domain knowledge exploitation.
Findings
Effective with multiple novelties
More sample efficient than transfer learning RL baselines
Robust to incomplete model information
Abstract
We propose RAPid-Learn: Learning to Recover and Plan Again, a hybrid planning and learning method, to tackle the problem of adapting to sudden and unexpected changes in an agent's environment (i.e., novelties). RAPid-Learn is designed to formulate and solve modifications to a task's Markov Decision Process (MDPs) on-the-fly and is capable of exploiting domain knowledge to learn any new dynamics caused by the environmental changes. It is capable of exploiting the domain knowledge to learn action executors which can be further used to resolve execution impasses, leading to a successful plan execution. This novelty information is reflected in its updated domain model. We demonstrate its efficacy by introducing a wide variety of novelties in a gridworld environment inspired by Minecraft, and compare our algorithm with transfer learning baselines from the literature. Our method is (1)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI-based Problem Solving and Planning · Bayesian Modeling and Causal Inference · Reinforcement Learning in Robotics
