The Gradient Convergence Bound of Federated Multi-Agent Reinforcement   Learning with Efficient Communication

Xing Xu; Rongpeng Li; Zhifeng Zhao; Honggang Zhang

arXiv:2103.13026·cs.LG·May 30, 2023

The Gradient Convergence Bound of Federated Multi-Agent Reinforcement Learning with Efficient Communication

Xing Xu, Rongpeng Li, Zhifeng Zhao, Honggang Zhang

PDF

Open Access

TL;DR

This paper introduces new optimization schemes for federated multi-agent reinforcement learning that reduce communication costs and improve convergence, especially in heterogeneous and asynchronous environments, backed by theoretical guarantees and simulations.

Contribution

It proposes decay-based and consensus-based gradient schemes with convergence guarantees, enhancing efficiency and robustness in federated multi-agent RL.

Findings

01

Decay-based scheme improves convergence speed.

02

Consensus-based scheme enhances communication efficiency.

03

Both schemes outperform existing methods in simulations.

Abstract

The paper considers independent reinforcement learning (IRL) for multi-agent collaborative decision-making in the paradigm of federated learning (FL). However, FL generates excessive communication overheads between agents and a remote central server, especially when it involves a large number of agents or iterations. Besides, due to the heterogeneity of independent learning environments, multiple agents may undergo asynchronous Markov decision processes (MDPs), which will affect the training samples and the model's convergence performance. On top of the variation-aware periodic averaging (VPA) method and the policy-based deep reinforcement learning (DRL) algorithm (i.e., proximal policy optimization (PPO)), this paper proposes two advanced optimization schemes orienting to stochastic gradient descent (SGD): 1) A decay-based scheme gradually decays the weights of a model's local…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Reinforcement Learning in Robotics · Distributed Control Multi-Agent Systems