MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning

Elise van der Pol; Daniel E. Worrall; Herke van Hoof; Frans A.; Oliehoek; Max Welling

arXiv:2006.16908·cs.LG·January 21, 2021·47 cites

MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning

Elise van der Pol, Daniel E. Worrall, Herke van Hoof, Frans A., Oliehoek, Max Welling

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper presents MDP homomorphic networks that incorporate group symmetry equivariance into reinforcement learning models, leading to faster convergence by reducing solution complexity.

Contribution

It introduces a method to embed group-structured symmetries into neural networks for reinforcement learning, including an easy construction of equivariant layers.

Findings

01

Faster convergence on CartPole, grid world, and Pong.

02

Networks are equivariant under reflections and rotations.

03

Reduces solution space by leveraging symmetry constraints.

Abstract

This paper introduces MDP homomorphic networks for deep reinforcement learning. MDP homomorphic networks are neural networks that are equivariant under symmetries in the joint state-action space of an MDP. Current approaches to deep reinforcement learning do not usually exploit knowledge about such structure. By building this prior knowledge into policy and value networks using an equivariance constraint, we can reduce the size of the solution space. We specifically focus on group-structured symmetries (invertible transformations). Additionally, we introduce an easy method for constructing equivariant network layers numerically, so the system designer need not solve the constraints by hand, as is typically done. We construct MDP homomorphic MLPs and CNNs that are equivariant under either a group of reflections or rotations. We show that such networks converge faster than unstructured…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Neural dynamics and brain function · Protein Structure and Dynamics