Single-Loop Federated Actor-Critic across Heterogeneous Environments

Ye Zhu; Xiaowen Gong

arXiv:2412.14555·cs.LG·December 25, 2024

Single-Loop Federated Actor-Critic across Heterogeneous Environments

Ye Zhu, Xiaowen Gong

PDF

Open Access 1 Video

TL;DR

This paper introduces SFAC, a federated actor-critic algorithm for heterogeneous environments, providing theoretical convergence guarantees and demonstrating improved sample efficiency through federated learning.

Contribution

The paper presents the first theoretical analysis of federated actor-critic algorithms in heterogeneous environments, including convergence bounds and sample complexity analysis.

Findings

01

Convergence error asymptotically approaches a near-stationary point.

02

Sample complexity benefits from linear speed-up with more agents.

03

Numerical experiments confirm the effectiveness of SFAC.

Abstract

Federated reinforcement learning (FRL) has emerged as a promising paradigm, enabling multiple agents to collaborate and learn a shared policy adaptable across heterogeneous environments. Among the various reinforcement learning (RL) algorithms, the actor-critic (AC) algorithm stands out for its low variance and high sample efficiency. However, little to nothing is known theoretically about AC in a federated manner, especially each agent interacts with a potentially different environment. The lack of such results is attributed to various technical challenges: a two-level structure illustrating the coupling effect between the actor and the critic, heterogeneous environments, Markovian sampling and multiple local updates. In response, we study \textit{Single-loop Federated Actor Critic} (SFAC) where agents perform actor-critic learning in a two-level federated manner while interacting with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Single-Loop Federated Actor-Critic across Heterogeneous Environments· underline

Taxonomy

TopicsParallel Computing and Optimization Techniques · Advanced Data Storage Technologies · Cloud Computing and Resource Management