First-Order Representation Languages for Goal-Conditioned RL
Simon St{\aa}hlberg, Hector Geffner

TL;DR
This paper explores how first-order languages and goal relabeling techniques like HER can improve goal-conditioned reinforcement learning, enabling efficient learning of general policies in large, complex planning tasks with sparse rewards.
Contribution
It introduces the use of set-based goal representations and lifted subgoals to enhance HER, facilitating curriculum learning and better generalization in large-scale goal-conditioned RL.
Findings
Set-based goal representations improve learning efficiency.
Lifted subgoals enable curriculum learning with sparse rewards.
The proposed methods demonstrate computational gains on large planning problems.
Abstract
First-order relational languages have been used in MDP planning and reinforcement learning (RL) for two main purposes: specifying MDPs in compact form, and representing and learning policies that are general and not tied to specific instances or state spaces. In this work, we instead consider the use of first-order languages in goal-conditioned RL and generalized planning. The question is how to learn goal-conditioned and general policies when the training instances are large and the goal cannot be reached by random exploration alone. The technique of Hindsight Experience Replay (HER) provides an answer to this question: it relabels unsuccessful trajectories as successful ones by replacing the original goal with one that was actually achieved. If the target policy must generalize across states and goals, trajectories that do not reach the original goal states can enable more data- and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI-based Problem Solving and Planning · Reinforcement Learning in Robotics · Artificial Intelligence in Games
