Sky Computing: Accelerating Geo-distributed Computing in Federated Learning
Jie Zhu, Shenggui Li, Yang You

TL;DR
Sky Computing introduces a load-balanced model parallelism framework for federated learning, optimizing weight allocation based on device capabilities, significantly reducing training time for large models across distributed devices.
Contribution
The paper proposes Sky Computing, a novel load-balanced model parallelism framework that adaptively allocates model weights according to device capabilities in federated learning.
Findings
Outperforms baseline by 55% in training time for 160-layer BERT
Effectively balances load across heterogeneous devices
Reduces training time significantly in federated settings
Abstract
Federated learning is proposed by Google to safeguard data privacy through training models locally on users' devices. However, with deep learning models growing in size to achieve better results, it becomes increasingly difficult to accommodate the whole model on one single device. Thus, model parallelism is then used to divide the model weights among several devices. With this logic, the approach currently used evenly allocates weights among devices. However, in reality, a computation bottleneck may occur resulting from variant computing power of different users' devices. To address this problem, load balancing is needed to allocate the model weights based on the computational capability of the device. In this paper, we proposed Sky Computing, a load-balanced model parallelism framework to adaptively allocate the weights to devices. Sky Computing outperforms the baseline method by 55%…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Recommender Systems and Techniques · Human Mobility and Location-Based Analysis
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Dense Connections · Softmax · Residual Connection · Weight Decay · Linear Warmup With Linear Decay · WordPiece · Layer Normalization
