FedFormer: Contextual Federation with Attention in Reinforcement   Learning

Liam Hebert; Lukasz Golab; Pascal Poupart; Robin Cohen

arXiv:2205.13697·cs.LG·July 19, 2023·1 cites

FedFormer: Contextual Federation with Attention in Reinforcement Learning

Liam Hebert, Lukasz Golab, Pascal Poupart, Robin Cohen

PDF

Open Access 1 Repo

TL;DR

FedFormer introduces a Transformer-based attention mechanism for federated reinforcement learning, enabling more effective aggregation of multi-agent insights and outperforming traditional averaging methods in complex environments.

Contribution

This paper presents FedFormer, a novel federation strategy using Transformer Attention to contextually aggregate agent models, improving performance in multi-agent reinforcement learning.

Findings

01

FedFormer outperforms FedAvg and single-agent methods in Meta-World tasks.

02

Higher episodic returns achieved with FedFormer while maintaining privacy constraints.

03

Effectiveness increases with larger agent pools, unlike FedAvg.

Abstract

A core issue in multi-agent federated reinforcement learning is defining how to aggregate insights from multiple agents. This is commonly done by taking the average of each participating agent's model weights into one common model (FedAvg). We instead propose FedFormer, a novel federation strategy that utilizes Transformer Attention to contextually aggregate embeddings from models originating from different learner agents. In so doing, we attentively weigh the contributions of other agents with respect to the current agent's environment and learned relationships, thus providing a more effective and efficient federation. We evaluate our methods on the Meta-World environment and find that our approach yields significant improvements over FedAvg and non-federated Soft Actor-Critic single-agent methods. Our results compared to Soft Actor-Critic show that FedFormer achieves higher episodic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liamhebert/FedFormer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Reinforcement Learning in Robotics · Ethics and Social Impacts of AI

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Softmax · Dense Connections · Absolute Position Encodings · Dropout · Byte Pair Encoding · Position-Wise Feed-Forward Layer