Heterogeneous Federated Reinforcement Learning Using Wasserstein Barycenters
Luiz Pereira, M. Hadi Amini

TL;DR
This paper introduces FedWB, a federated learning algorithm using Wasserstein barycenters for model aggregation, and extends it to heterogeneous reinforcement learning, demonstrated on CartPole with varying pole lengths.
Contribution
The paper presents a novel Wasserstein barycenter-based aggregation method and applies it to heterogeneous federated reinforcement learning, addressing model fusion in diverse environments.
Findings
Wasserstein barycenter aggregation improves model fusion quality.
FedWB effectively handles heterogeneous environments in reinforcement learning.
Global DQN generalizes across varied CartPole setups.
Abstract
In this paper, we first propose a novel algorithm for model fusion that leverages Wasserstein barycenters in training a global Deep Neural Network (DNN) in a distributed architecture. To this end, we divide the dataset into equal parts that are fed to "agents" who have identical deep neural networks and train only over the dataset fed to them (known as the local dataset). After some training iterations, we perform an aggregation step where we combine the weight parameters of all neural networks using Wasserstein barycenters. These steps form the proposed algorithm referred to as FedWB. Moreover, we leverage the processes created in the first part of the paper to develop an algorithm to tackle Heterogeneous Federated Reinforcement Learning (HFRL). Our test experiment is the CartPole toy problem, where we vary the lengths of the poles to create heterogeneous environments. We train a deep…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning
