Finite-Time Analysis of On-Policy Heterogeneous Federated Reinforcement Learning
Chenyu Zhang, Han Wang, Aritra Mitra, James Anderson

TL;DR
This paper introduces FedSARSA, a federated on-policy reinforcement learning algorithm with linear function approximation, providing finite-time performance guarantees and demonstrating near-optimal convergence and linear speedup benefits in heterogeneous multi-agent environments.
Contribution
The paper presents FedSARSA, the first finite-time analysis of on-policy heterogeneous federated reinforcement learning with convergence guarantees and linear speedup results.
Findings
FedSARSA converges to near-optimal policies for all agents.
The algorithm achieves linear speedup with increasing number of agents.
Performance degrades gracefully with heterogeneity levels.
Abstract
Federated reinforcement learning (FRL) has emerged as a promising paradigm for reducing the sample complexity of reinforcement learning tasks by exploiting information from different agents. However, when each agent interacts with a potentially different environment, little to nothing is known theoretically about the non-asymptotic performance of FRL algorithms. The lack of such results can be attributed to various technical challenges and their intricate interplay: Markovian sampling, linear function approximation, multiple local updates to save communication, heterogeneity in the reward functions and transition kernels of the agents' MDPs, and continuous state-action spaces. Moreover, in the on-policy setting, the behavior policies vary with time, further complicating the analysis. In response, we introduce FedSARSA, a novel federated on-policy reinforcement learning scheme, equipped…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Transportation and Mobility Innovations · Reinforcement Learning in Robotics
