Adding vs. Averaging in Distributed Primal-Dual Optimization
Chenxin Ma, Virginia Smith, Martin Jaggi, Michael I. Jordan, Peter, Richt\'arik, Martin Tak\'a\v{c}

TL;DR
This paper introduces CoCoA+, a generalized distributed optimization framework that allows additive updates instead of averaging, leading to faster convergence and better scalability in large-scale machine learning tasks.
Contribution
We propose CoCoA+, a novel extension of the CoCoA framework that enables additive updates, with improved convergence guarantees and applicability to non-smooth convex functions.
Findings
CoCoA+ outperforms previous methods on real-world datasets.
Additive updates improve scalability with more machines.
Theoretical convergence guarantees are strengthened for CoCoA+.
Abstract
Distributed optimization methods for large-scale machine learning suffer from a communication bottleneck. It is difficult to reduce this bottleneck while still efficiently and accurately aggregating partial work from different machines. In this paper, we present a novel generalization of the recent communication-efficient primal-dual framework (CoCoA) for distributed optimization. Our framework, CoCoA+, allows for additive combination of local updates to the global parameters at each iteration, whereas previous schemes with convergence guarantees only allow conservative averaging. We give stronger (primal-dual) convergence rate guarantees for both CoCoA as well as our new variants, and generalize the theory for both methods to cover non-smooth convex loss functions. We provide an extensive experimental comparison that shows the markedly improved performance of CoCoA+ on several…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Bandit Algorithms Research
