Finite-Time Analysis of On-Policy Heterogeneous Federated Reinforcement   Learning

Chenyu Zhang; Han Wang; Aritra Mitra; James Anderson

arXiv:2401.15273·cs.LG·April 16, 2024·1 cites

Finite-Time Analysis of On-Policy Heterogeneous Federated Reinforcement Learning

Chenyu Zhang, Han Wang, Aritra Mitra, James Anderson

PDF

Open Access

TL;DR

This paper introduces FedSARSA, a federated on-policy reinforcement learning algorithm with linear function approximation, providing finite-time performance guarantees and demonstrating near-optimal convergence and linear speedup benefits in heterogeneous multi-agent environments.

Contribution

The paper presents FedSARSA, the first finite-time analysis of on-policy heterogeneous federated reinforcement learning with convergence guarantees and linear speedup results.

Findings

01

FedSARSA converges to near-optimal policies for all agents.

02

The algorithm achieves linear speedup with increasing number of agents.

03

Performance degrades gracefully with heterogeneity levels.

Abstract

Federated reinforcement learning (FRL) has emerged as a promising paradigm for reducing the sample complexity of reinforcement learning tasks by exploiting information from different agents. However, when each agent interacts with a potentially different environment, little to nothing is known theoretically about the non-asymptotic performance of FRL algorithms. The lack of such results can be attributed to various technical challenges and their intricate interplay: Markovian sampling, linear function approximation, multiple local updates to save communication, heterogeneity in the reward functions and transition kernels of the agents' MDPs, and continuous state-action spaces. Moreover, in the on-policy setting, the behavior policies vary with time, further complicating the analysis. In response, we introduce FedSARSA, a novel federated on-policy reinforcement learning scheme, equipped…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraffic control and management · Transportation and Mobility Innovations · Reinforcement Learning in Robotics