On the Convergence Rates of Federated Q-Learning across Heterogeneous Environments
Leo Muxing Wang, Pengkun Yang, Lili Su

TL;DR
This paper analyzes the convergence rates of federated Q-learning in heterogeneous environments, revealing fundamental limitations and phase-transition phenomena affecting performance and suggesting strategies for improvement.
Contribution
It provides a detailed characterization of error dynamics in federated Q-learning under heterogeneity and identifies the impact of multiple local updates on convergence speed.
Findings
Linear speed-up in error reduction with respect to number of agents K.
Performance degradation when multiple local updates E > 1.
Existence of a phase transition in convergence behavior.
Abstract
Large-scale multi-agent systems are often deployed across wide geographic areas, where agents interact with heterogeneous environments. There is an emerging interest in understanding the role of heterogeneity in the performance of the federated versions of classic reinforcement learning algorithms. In this paper, we study synchronous federated Q-learning, which aims to learn an optimal Q-function by having agents average their local Q-estimates per iterations. We observe an interesting phenomenon on the convergence speeds in terms of and . Similar to the homogeneous environment settings, there is a linear speed-up concerning in reducing the errors that arise from sampling randomness. Yet, in sharp contrast to the homogeneous settings, leads to significant performance degradation. Specifically, we provide a fine-grained characterization of the error evolution in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Distributed Sensor Networks and Detection Algorithms · Face and Expression Recognition
