Bisimulation Makes Analogies in Goal-Conditioned Reinforcement Learning

Philippe Hansen-Estruch; Amy Zhang; Ashvin Nair; Patrick Yin; Sergey; Levine

arXiv:2204.13060·cs.LG·May 18, 2022·5 cites

Bisimulation Makes Analogies in Goal-Conditioned Reinforcement Learning

Philippe Hansen-Estruch, Amy Zhang, Ashvin Nair, Patrick Yin, Sergey, Levine

PDF

Open Access

TL;DR

This paper introduces goal-conditioned bisimulation, a novel state abstraction method that enables goal generalization in reinforcement learning by inferring goals from analogous tasks, demonstrated through simulation manipulation tasks.

Contribution

It proposes a new form of state abstraction called goal-conditioned bisimulation, and develops a metric learning approach to generalize to new goals in RL tasks.

Findings

01

Successfully generalizes to new goals in simulation tasks

02

Representation is sufficient for downstream state-only reward tasks

03

Achieves skill reuse through functional equivariance

Abstract

Building generalizable goal-conditioned agents from rich observations is a key to reinforcement learning (RL) solving real world problems. Traditionally in goal-conditioned RL, an agent is provided with the exact goal they intend to reach. However, it is often not realistic to know the configuration of the goal before performing a task. A more scalable framework would allow us to provide the agent with an example of an analogous task, and have the agent then infer what the goal should be for its current state. We propose a new form of state abstraction called goal-conditioned bisimulation that captures functional equivariance, allowing for the reuse of skills to achieve new goals. We learn this representation using a metric form of this abstraction, and show its ability to generalize to new goals in simulation manipulation tasks. Further, we prove that this learned representation is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning