On the Convergence Rates of Federated Q-Learning across Heterogeneous Environments

Leo Muxing Wang; Pengkun Yang; Lili Su

arXiv:2409.03897·cs.LG·May 18, 2026

On the Convergence Rates of Federated Q-Learning across Heterogeneous Environments

Leo Muxing Wang, Pengkun Yang, Lili Su

PDF

TL;DR

This paper analyzes the convergence rates of federated Q-learning in heterogeneous environments, revealing fundamental limitations and phase-transition phenomena affecting performance and suggesting strategies for improvement.

Contribution

It provides a detailed characterization of error dynamics in federated Q-learning under heterogeneity and identifies the impact of multiple local updates on convergence speed.

Findings

01

Linear speed-up in error reduction with respect to number of agents K.

02

Performance degradation when multiple local updates E > 1.

03

Existence of a phase transition in convergence behavior.

Abstract

Large-scale multi-agent systems are often deployed across wide geographic areas, where agents interact with heterogeneous environments. There is an emerging interest in understanding the role of heterogeneity in the performance of the federated versions of classic reinforcement learning algorithms. In this paper, we study synchronous federated Q-learning, which aims to learn an optimal Q-function by having $K$ agents average their local Q-estimates per $E$ iterations. We observe an interesting phenomenon on the convergence speeds in terms of $K$ and $E$ . Similar to the homogeneous environment settings, there is a linear speed-up concerning $K$ in reducing the errors that arise from sampling randomness. Yet, in sharp contrast to the homogeneous settings, $E > 1$ leads to significant performance degradation. Specifically, we provide a fine-grained characterization of the error evolution in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Distributed Sensor Networks and Detection Algorithms · Face and Expression Recognition