Hierarchical Federated Learning with Multi-Timescale Gradient Correction

Wenzhi Fang; Dong-Jun Han; Evan Chen; Shiqiang Wang; and Christopher; G. Brinton

arXiv:2409.18448·cs.LG·December 18, 2024

Hierarchical Federated Learning with Multi-Timescale Gradient Correction

Wenzhi Fang, Dong-Jun Han, Evan Chen, Shiqiang Wang, and Christopher, G. Brinton

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a multi-timescale gradient correction method for hierarchical federated learning, effectively addressing model drift across multiple levels of data heterogeneity and demonstrating stability and improved convergence.

Contribution

It proposes a novel gradient correction approach for HFL that handles multi-level data heterogeneity and provides theoretical convergence guarantees.

Findings

01

Convergence bound is immune to data heterogeneity.

02

Method effectively reduces model drift at multiple hierarchy levels.

03

Validated through extensive experiments on various datasets.

Abstract

While traditional federated learning (FL) typically focuses on a star topology where clients are directly connected to a central server, real-world distributed systems often exhibit hierarchical architectures. Hierarchical FL (HFL) has emerged as a promising solution to bridge this gap, leveraging aggregation points at multiple levels of the system. However, existing algorithms for HFL encounter challenges in dealing with multi-timescale model drift, i.e., model drift occurring across hierarchical levels of data heterogeneity. In this paper, we propose a multi-timescale gradient correction (MTGC) methodology to resolve this issue. Our key idea is to introduce distinct control variables to (i) correct the client gradient towards the group gradient, i.e., to reduce client model drift caused by local updates based on individual datasets, and (ii) correct the group gradient towards the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wenzhifang/mtgc
pytorchOfficial

Videos

Hierarchical Federated Learning with Multi-Timescale Gradient Correction· slideslive

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Brain Tumor Detection and Classification