Heterogeneity-Aware Coordination for Federated Learning via Stitching Pre-trained blocks
Shichen Zhan, Yebo Wu, Chunlin Tian, Yan Zhao, Li Li

TL;DR
FedStitch introduces a heterogeneity-aware federated learning framework that stitches pre-trained blocks, significantly reducing memory and energy costs while improving model accuracy in diverse device environments.
Contribution
It proposes a novel stitching-based approach for federated learning using pre-trained blocks, incorporating RL-based aggregation and energy optimization.
Findings
Model accuracy improved by up to 20.93%
Memory footprint reduced by up to 79.5%
Energy consumption decreased by 89.41%
Abstract
Federated learning (FL) coordinates multiple devices to collaboratively train a shared model while preserving data privacy. However, large memory footprint and high energy consumption during the training process excludes the low-end devices from contributing to the global model with their own data, which severely deteriorates the model performance in real-world scenarios. In this paper, we propose FedStitch, a hierarchical coordination framework for heterogeneous federated learning with pre-trained blocks. Unlike the traditional approaches that train the global model from scratch, for a new task, FedStitch composes the global model via stitching pre-trained blocks. Specifically, each participating client selects the most suitable block based on their local data from the candidate pool composed of blocks from pre-trained models. The server then aggregates the optimal block for stitching.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Advanced Graph Neural Networks
