Equivariant Goal Conditioned Contrastive Reinforcement Learning

Arsh Tangri; Nichols Crawford Taylor; Haojie Huang; Robert Platt

arXiv:2507.16139·cs.RO·July 23, 2025

Equivariant Goal Conditioned Contrastive Reinforcement Learning

Arsh Tangri, Nichols Crawford Taylor, Haojie Huang, Robert Platt

PDF

Open Access

TL;DR

This paper introduces Equivariant CRL, a method that incorporates symmetry constraints into contrastive reinforcement learning to improve sample efficiency and generalization in goal-conditioned robotic manipulation tasks.

Contribution

The paper proposes a novel equivariant CRL framework that leverages task symmetries, formalizes goal-conditioned group-invariant MDPs, and demonstrates improved performance in simulation and offline settings.

Findings

01

Outperforms baselines in simulated tasks

02

Enhances sample efficiency and spatial generalization

03

Effective in both state-based and image-based settings

Abstract

Contrastive Reinforcement Learning (CRL) provides a promising framework for extracting useful structured representations from unlabeled interactions. By pulling together state-action pairs and their corresponding future states, while pushing apart negative pairs, CRL enables learning nontrivial policies without manually designed rewards. In this work, we propose Equivariant CRL (ECRL), which further structures the latent space using equivariant constraints. By leveraging inherent symmetries in goal-conditioned manipulation tasks, our method improves both sample efficiency and spatial generalization. Specifically, we formally define Goal-Conditioned Group-Invariant MDPs to characterize rotation-symmetric robotic manipulation tasks, and build on this by introducing a novel rotation-invariant critic representation paired with a rotation-equivariant actor for Contrastive RL. Our approach…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsElevator Systems and Control · Reinforcement Learning in Robotics · Adaptive Dynamic Programming Control