Heterogeneity-Aware Coordination for Federated Learning via Stitching   Pre-trained blocks

Shichen Zhan; Yebo Wu; Chunlin Tian; Yan Zhao; Li Li

arXiv:2409.07202·cs.LG·September 12, 2024

Heterogeneity-Aware Coordination for Federated Learning via Stitching Pre-trained blocks

Shichen Zhan, Yebo Wu, Chunlin Tian, Yan Zhao, Li Li

PDF

Open Access

TL;DR

FedStitch introduces a heterogeneity-aware federated learning framework that stitches pre-trained blocks, significantly reducing memory and energy costs while improving model accuracy in diverse device environments.

Contribution

It proposes a novel stitching-based approach for federated learning using pre-trained blocks, incorporating RL-based aggregation and energy optimization.

Findings

01

Model accuracy improved by up to 20.93%

02

Memory footprint reduced by up to 79.5%

03

Energy consumption decreased by 89.41%

Abstract

Federated learning (FL) coordinates multiple devices to collaboratively train a shared model while preserving data privacy. However, large memory footprint and high energy consumption during the training process excludes the low-end devices from contributing to the global model with their own data, which severely deteriorates the model performance in real-world scenarios. In this paper, we propose FedStitch, a hierarchical coordination framework for heterogeneous federated learning with pre-trained blocks. Unlike the traditional approaches that train the global model from scratch, for a new task, FedStitch composes the global model via stitching pre-trained blocks. Specifically, each participating client selects the most suitable block based on their local data from the candidate pool composed of blocks from pre-trained models. The server then aggregates the optimal block for stitching.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Advanced Graph Neural Networks