Learning Temporally Extended Skills in Continuous Domains as Symbolic Actions for Planning
Jan Achterhold, Markus Krimmel, Joerg Stueckler

TL;DR
This paper introduces SEADS, a hierarchical reinforcement learning agent that learns diverse skills and a symbolic forward model to enable long-horizon planning and continuous control in complex tasks, demonstrated on a real robot.
Contribution
The paper presents a novel approach combining unsupervised skill learning with symbolic planning using a forward model, enabling effective long-horizon control in continuous domains.
Findings
SEADS outperforms baseline agents in complex tasks.
Successfully demonstrated on a real robot.
Learns diverse skills and symbolic models unsupervised.
Abstract
Problems which require both long-horizon planning and continuous control capabilities pose significant challenges to existing reinforcement learning agents. In this paper we introduce a novel hierarchical reinforcement learning agent which links temporally extended skills for continuous control with a forward model in a symbolic discrete abstraction of the environment's state for planning. We term our agent SEADS for Symbolic Effect-Aware Diverse Skills. We formulate an objective and corresponding algorithm which leads to unsupervised learning of a diverse set of skills through intrinsic motivation given a known state abstraction. The skills are jointly learned with the symbolic forward model which captures the effect of skill execution in the state abstraction. After training, we can leverage the skills as symbolic actions using the forward model for long-horizon planning and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Evolutionary Algorithms and Applications
