Timely Asynchronous Hierarchical Federated Learning: Age of Convergence
Purbesh Mitra, Sennur Ulukus

TL;DR
This paper analyzes the convergence behavior of asynchronous hierarchical federated learning systems, demonstrating probabilistic guarantees of finite-time convergence under specific clustering conditions to optimize training timeliness.
Contribution
It introduces convergence criteria for AHFL systems with dense clusters, highlighting the importance of sparse edge server deployment for guaranteed convergence.
Findings
Finite-time convergence is probabilistically guaranteed in dense client clusters.
System convergence depends on the number of clusters being constant (O(1)).
Sparse edge server deployment enhances convergence guarantees.
Abstract
We consider an asynchronous hierarchical federated learning (AHFL) setting with a client-edge-cloud framework. The clients exchange the trained parameters with their corresponding edge servers, which update the locally aggregated model. This model is then transmitted to all the clients in the local cluster. The edge servers communicate to the central cloud server for global model aggregation. The goal of each client is to converge to the global model, while maintaining timeliness of the clients, i.e., having optimum training iteration time. We investigate the convergence criteria for such a system with dense clusters. Our analysis shows that for a system of clients with fixed average timeliness, the convergence in finite time is probabilistically guaranteed, if the nodes are divided into number of clusters, that is, if the system is built as a sparse set of edge servers with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAge of Information Optimization · Privacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques
