Momentum-Based Federated Reinforcement Learning with Interaction and   Communication Efficiency

Sheng Yue; Xingyuan Hua; Lili Chen; Ju Ren

arXiv:2405.17471·cs.LG·May 30, 2024

Momentum-Based Federated Reinforcement Learning with Interaction and Communication Efficiency

Sheng Yue, Xingyuan Hua, Lili Chen, Ju Ren

PDF

Open Access

TL;DR

This paper presents MFPO, a federated reinforcement learning algorithm that improves interaction and communication efficiency using momentum, importance sampling, and server adjustments, achieving near-optimal complexities and better performance.

Contribution

The paper introduces MFPO, a novel FRL algorithm that reduces interaction and communication costs while maintaining high performance, with theoretical complexity guarantees and extensive experimental validation.

Findings

01

MFPO achieves linear speedup with the number of agents.

02

MFPO attains the best known communication complexity for first-order FL algorithms.

03

Experiments show significant performance improvements over existing methods.

Abstract

Federated Reinforcement Learning (FRL) has garnered increasing attention recently. However, due to the intrinsic spatio-temporal non-stationarity of data distributions, the current approaches typically suffer from high interaction and communication costs. In this paper, we introduce a new FRL algorithm, named $MFPO$ , that utilizes momentum, importance sampling, and additional server-side adjustment to control the shift of stochastic policy gradients and enhance the efficiency of data utilization. We prove that by proper selection of momentum parameters and interaction frequency, $MFPO$ can achieve $\tilde{O} (H N^{- 1} ϵ^{- 3/2})$ and $\tilde{O} (ϵ^{- 1})$ interaction and communication complexities ( $N$ represents the number of agents), where the interaction complexity achieves linear speedup with the number of agents, and the communication…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Traffic control and management