Equivariant Reinforcement Learning under Partial Observability

Hai Nguyen; Andrea Baisero; David Klee; Dian Wang; Robert Platt,; Christopher Amato

arXiv:2408.14336·cs.RO·August 27, 2024·2 cites

Equivariant Reinforcement Learning under Partial Observability

Hai Nguyen, Andrea Baisero, David Klee, Dian Wang, Robert Platt,, Christopher Amato

PDF

Open Access

TL;DR

This paper introduces equivariant reinforcement learning agents that leverage symmetries in partially observable environments, leading to more sample-efficient and effective robotic learning in simulation and real-world tasks.

Contribution

It proposes encoding equivariance into neural networks for reinforcement learning, improving sample efficiency and performance in partially observable robotic domains.

Findings

01

Equivariant agents outperform non-equivariant methods in sample efficiency.

02

Significant performance improvements demonstrated in robotic simulation tasks.

03

Effective transfer of solutions across related scenarios using symmetry encoding.

Abstract

Incorporating inductive biases is a promising approach for tackling challenging robot learning domains with sample-efficient solutions. This paper identifies partially observable domains where symmetries can be a useful inductive bias for efficient learning. Specifically, by encoding the equivariance regarding specific group symmetries into the neural networks, our actor-critic reinforcement learning agents can reuse solutions in the past for related scenarios. Consequently, our equivariant agents outperform non-equivariant approaches significantly in terms of sample efficiency and final performance, demonstrated through experiments on a range of robotic tasks in simulation and real hardware.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsElevator Systems and Control · Fault Detection and Control Systems · Software Reliability and Analysis Research