Skill-based Model-based Reinforcement Learning
Lucy Xiaoyang Shi, Joseph J. Lim, Youngwoon Lee

TL;DR
SkiMo introduces a skill-based model-based RL framework that plans in skill space using a learned skill dynamics model, significantly enhancing long-horizon planning and sample efficiency in complex tasks.
Contribution
The paper proposes a novel skill-based planning approach in model-based RL, jointly learning a skill repertoire and dynamics model for improved long-term planning.
Findings
Extends the effective planning horizon of model-based RL.
Improves sample efficiency in long-horizon tasks.
Enhances performance in navigation and manipulation domains.
Abstract
Model-based reinforcement learning (RL) is a sample-efficient way of learning complex behaviors by leveraging a learned single-step dynamics model to plan actions in imagination. However, planning every action for long-horizon tasks is not practical, akin to a human planning out every muscle movement. Instead, humans efficiently plan with high-level skills to solve complex tasks. From this intuition, we propose a Skill-based Model-based RL framework (SkiMo) that enables planning in the skill space using a skill dynamics model, which directly predicts the skill outcomes, rather than predicting all small details in the intermediate states, step by step. For accurate and efficient long-term planning, we jointly learn the skill dynamics model and a skill repertoire from prior experience. We then harness the learned skill dynamics model to accurately simulate and plan over long horizons in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robotic Locomotion and Control · Educational Games and Gamification
