Convergence Guarantees for Federated SARSA with Local Training and Heterogeneous Agents

Paul Mangold; Elo\"ise Berthier; Eric Moulines

arXiv:2512.17688·cs.LG·February 5, 2026

Convergence Guarantees for Federated SARSA with Local Training and Heterogeneous Agents

Paul Mangold, Elo\"ise Berthier, Eric Moulines

PDF

Open Access

TL;DR

This paper provides the first convergence analysis and complexity bounds for Federated SARSA with heterogeneous agents, demonstrating linear speed-up and supporting the theory with numerical experiments.

Contribution

It introduces a novel theoretical framework for Federated SARSA with heterogeneity, including a new multi-step error expansion and convergence guarantees.

Findings

01

FedSARSA converges despite heterogeneity in local data.

02

The method achieves linear speed-up with the number of agents.

03

Numerical results validate the theoretical analysis.

Abstract

We present a novel theoretical analysis of Federated SARSA (FedSARSA) with linear function approximation and local training. We establish convergence guarantees for FedSARSA in the presence of heterogeneity, both in local transitions and rewards, providing the first sample and communication complexity bounds in this setting. At the core of our analysis is a new, exact multi-step error expansion for single-agent SARSA, which is of independent interest. Our analysis precisely quantifies the impact of heterogeneity, demonstrating the convergence of FedSARSA with multiple local updates. Crucially, we show that FedSARSA achieves linear speed-up with respect to the number of agents, up to higher-order terms due to Markovian sampling. Numerical experiments support our theoretical findings.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Age of Information Optimization · Wireless Communication Security Techniques