TL;DR
This paper introduces a distributed learning algorithm that reduces communication by triggering exchanges only when necessary, is robust to data heterogeneity, and outperforms existing methods on standard datasets.
Contribution
It proposes an event-based ADMM algorithm for distributed learning that minimizes communication and guarantees convergence regardless of data distribution.
Findings
Achieves over 35% communication savings.
Demonstrates robustness to heterogeneous data.
Outperforms FedAvg, FedProx, SCAFFOLD, and FedADMM.
Abstract
We consider a distributed learning problem, where agents minimize a global objective function by exchanging information over a network. Our approach has two distinct features: (i) It substantially reduces communication by triggering communication only when necessary, and (ii) it is agnostic to the data-distribution among the different agents. We therefore guarantee convergence even if the local data-distributions of the agents are arbitrarily distinct. We analyze the convergence rate of the algorithm both in convex and nonconvex settings and derive accelerated convergence rates for the convex case. We also characterize the effect of communication failures and demonstrate that our algorithm is robust to these. The article concludes by presenting numerical results from distributed learning tasks on the MNIST and CIFAR-10 datasets. The experiments underline communication savings of 35% or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
