Distributed Machine Learning via Sufficient Factor Broadcasting
Pengtao Xie, Jin Kyu Kim, Yi Zhou, Qirong Ho, Abhimanu Kumar, Yaoliang, Yu, Eric Xing

TL;DR
This paper introduces Sufficient Factor Broadcasting, a distributed learning method that reduces communication costs for large matrix-parametrized models by broadcasting rank-1 updates, enabling efficient large-scale machine learning.
Contribution
It proposes a novel SFB model that leverages rank-1 updates for efficient distributed training of matrix-parameterized models, with proven convergence and empirical validation.
Findings
Reduces communication costs from quadratic to linear in parameter dimensions.
Achieves efficient distributed training for large-scale models.
Demonstrates effectiveness on four different ML models.
Abstract
Matrix-parametrized models, including multiclass logistic regression and sparse coding, are used in machine learning (ML) applications ranging from computer vision to computational biology. When these models are applied to large-scale ML problems starting at millions of samples and tens of thousands of classes, their parameter matrix can grow at an unexpected rate, resulting in high parameter synchronization costs that greatly slow down distributed learning. To address this issue, we propose a Sufficient Factor Broadcasting (SFB) computation model for efficient distributed learning of a large family of matrix-parameterized models, which share the following property: the parameter update computed on each data sample is a rank-1 matrix, i.e., the outer product of two "sufficient factors" (SFs). By broadcasting the SFs among worker machines and reconstructing the update matrices locally at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced MIMO Systems Optimization · Stochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques
MethodsLogistic Regression
