$\mathrm{SO}(2)$-Equivariant Reinforcement Learning

Dian Wang; Robin Walters; Robert Platt

arXiv:2203.04439·cs.RO·March 10, 2022·5 cites

$\mathrm{SO}(2)$-Equivariant Reinforcement Learning

Dian Wang, Robin Walters, Robert Platt

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces equivariant neural network architectures for reinforcement learning, specifically in robotic manipulation, demonstrating improved sample efficiency by leveraging symmetry properties in $Q$-learning and actor-critic methods.

Contribution

It proposes equivariant DQN and SAC algorithms that incorporate symmetry structures, enhancing learning efficiency in rotationally symmetric tasks.

Findings

01

Equivariant models outperform non-equivariant ones in sample efficiency.

02

The proposed algorithms effectively leverage symmetry in robotic manipulation.

03

Experimental results show significant improvements over competing methods.

Abstract

Equivariant neural networks enforce symmetry within the structure of their convolutional layers, resulting in a substantial improvement in sample efficiency when learning an equivariant or invariant function. Such models are applicable to robotic manipulation learning which can often be formulated as a rotationally symmetric problem. This paper studies equivariant model architectures in the context of $Q$ -learning and actor-critic reinforcement learning. We identify equivariant and invariant characteristics of the optimal $Q$ -function and the optimal policy and propose equivariant DQN and SAC algorithms that leverage this structure. We present experiments that demonstrate that our equivariant versions of DQN and SAC can be significantly more sample efficient than competing algorithms on an important class of robotic manipulation problems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pointW/equi_rl
pytorchOfficial

Videos

$\mathrm{SO}(2)$-Equivariant Reinforcement Learning· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Model Reduction and Neural Networks