How a Small Amount of Data Sharing Benefits Distributed Optimization and Learning : The Upside of Data Heterogeneity
Mingxi Zhu, Yinyu Ye

TL;DR
This paper demonstrates that sharing a small amount of data in distributed optimization can significantly enhance convergence, especially in primal-dual methods, challenging the belief that heterogeneity is always harmful.
Contribution
It provides a theoretical analysis of how minimal data sharing benefits different classes of distributed algorithms and introduces a meta-algorithm for practical implementation.
Findings
Minimal data sharing improves algorithmic performance.
Heterogeneity can accelerate primal-dual algorithms.
As little as 1% data sharing significantly speeds up convergence.
Abstract
Distributed optimization algorithms are widely used in machine learning. This paper investigates how a small amount of data sharing can improve their performance. Focusing on general linear models, we analyze the effects of data sharing on both primal and primal-dual optimization methods. Our contributions are threefold. First, from a theoretical perspective, we show that minimal data sharing improves algorithmic performance by shifting data from less favorable to more favorable structures. Contrary to the common belief that data heterogeneity is always harmful, we prove that while heterogeneity generally slows convergence in primal methods such as FedAvg and distributed PCG, it can accelerate convergence in primal-dual consensus algorithms like distributed ADMM, Fed-ADMM, and EXTRA by enriching dual dynamics. This reveals a form of duality in how heterogeneity affects different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Machine Learning and ELM · Stochastic Gradient Optimization Techniques
