Hierarchical Federated Learning with Multi-Timescale Gradient Correction
Wenzhi Fang, Dong-Jun Han, Evan Chen, Shiqiang Wang, and Christopher, G. Brinton

TL;DR
This paper introduces a multi-timescale gradient correction method for hierarchical federated learning, effectively addressing model drift across multiple levels of data heterogeneity and demonstrating stability and improved convergence.
Contribution
It proposes a novel gradient correction approach for HFL that handles multi-level data heterogeneity and provides theoretical convergence guarantees.
Findings
Convergence bound is immune to data heterogeneity.
Method effectively reduces model drift at multiple hierarchy levels.
Validated through extensive experiments on various datasets.
Abstract
While traditional federated learning (FL) typically focuses on a star topology where clients are directly connected to a central server, real-world distributed systems often exhibit hierarchical architectures. Hierarchical FL (HFL) has emerged as a promising solution to bridge this gap, leveraging aggregation points at multiple levels of the system. However, existing algorithms for HFL encounter challenges in dealing with multi-timescale model drift, i.e., model drift occurring across hierarchical levels of data heterogeneity. In this paper, we propose a multi-timescale gradient correction (MTGC) methodology to resolve this issue. Our key idea is to introduce distinct control variables to (i) correct the client gradient towards the group gradient, i.e., to reduce client model drift caused by local updates based on individual datasets, and (ii) correct the group gradient towards the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Brain Tumor Detection and Classification
