Boosting Sample Efficiency and Generalization in Multi-agent   Reinforcement Learning via Equivariance

Joshua McClellan; Naveed Haghani; John Winder; Furong Huang; Pratap; Tokekar

arXiv:2410.02581·cs.LG·October 23, 2024

Boosting Sample Efficiency and Generalization in Multi-agent Reinforcement Learning via Equivariance

Joshua McClellan, Naveed Haghani, John Winder, Furong Huang, Pratap, Tokekar

PDF

Open Access 1 Video

TL;DR

This paper introduces E2GN2, an equivariant graph neural network designed for multi-agent reinforcement learning, which significantly improves sample efficiency and generalization by leveraging symmetry structures, addressing exploration biases, and outperforming standard GNNs.

Contribution

The paper proposes E2GN2, a novel exploration-enhanced equivariant graph neural network that improves sample efficiency and generalization in multi-agent reinforcement learning.

Findings

01

E2GN2 outperforms standard GNNs in sample efficiency.

02

E2GN2 achieves greater final reward convergence.

03

E2GN2 shows 2x-5x better generalization in tests.

Abstract

Multi-Agent Reinforcement Learning (MARL) struggles with sample inefficiency and poor generalization [1]. These challenges are partially due to a lack of structure or inductive bias in the neural networks typically used in learning the policy. One such form of structure that is commonly observed in multi-agent scenarios is symmetry. The field of Geometric Deep Learning has developed Equivariant Graph Neural Networks (EGNN) that are equivariant (or symmetric) to rotations, translations, and reflections of nodes. Incorporating equivariance has been shown to improve learning efficiency and decrease error [ 2 ]. In this paper, we demonstrate that EGNNs improve the sample efficiency and generalization in MARL. However, we also show that a naive application of EGNNs to MARL results in poor early exploration due to a bias in the EGNN structure. To mitigate this bias, we present…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Boosting Sample Efficiency and Generalization in Multi-agent Reinforcement Learning via Equivariance· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics