Federated TD Learning over Finite-Rate Erasure Channels: Linear Speedup   under Markovian Sampling

Nicol\`o Dal Fabbro; Aritra Mitra; George J. Pappas

arXiv:2305.08104·cs.LG·May 16, 2023·1 cites

Federated TD Learning over Finite-Rate Erasure Channels: Linear Speedup under Markovian Sampling

Nicol\`o Dal Fabbro, Aritra Mitra, George J. Pappas

PDF

Open Access

TL;DR

This paper introduces QFedTD, a federated reinforcement learning algorithm that accounts for communication constraints, demonstrating a linear speedup in policy evaluation with finite-sample guarantees under Markovian sampling.

Contribution

It provides the first non-asymptotic analysis of quantization and erasure effects in federated reinforcement learning, establishing linear speedup guarantees.

Findings

01

QFedTD achieves linear speedup with respect to the number of agents.

02

Quantization and packet erasures impact convergence rates.

03

First finite-sample analysis of communication constraints in federated RL.

Abstract

Federated learning (FL) has recently gained much attention due to its effectiveness in speeding up supervised learning tasks under communication and privacy constraints. However, whether similar speedups can be established for reinforcement learning remains much less understood theoretically. Towards this direction, we study a federated policy evaluation problem where agents communicate via a central aggregator to expedite the evaluation of a common policy. To capture typical communication constraints in FL, we consider finite capacity up-link channels that can drop packets based on a Bernoulli erasure model. Given this setting, we propose and analyze QFedTD - a quantized federated temporal difference learning algorithm with linear function approximation. Our main technical contribution is to provide a finite-sample analysis of QFedTD that (i) highlights the effect of quantization and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCooperative Communication and Network Coding · Advanced MIMO Systems Optimization · Age of Information Optimization