Communication Efficient Distributed Optimization using an Approximate Newton-type Method
Ohad Shamir, Nathan Srebro, Tong Zhang

TL;DR
This paper introduces a new Newton-type method for distributed optimization that converges quickly and scales well with data size, especially effective for stochastic learning problems.
Contribution
It proposes a novel Newton-type algorithm tailored for distributed stochastic optimization with proven linear convergence that improves with data size.
Findings
Achieves linear convergence rate for quadratic objectives
Requires an essentially constant number of iterations as data size grows
Outperforms one-shot averaging and ADMM in experiments
Abstract
We present a novel Newton-type method for distributed optimization, which is particularly well suited for stochastic optimization and learning problems. For quadratic objectives, the method enjoys a linear rate of convergence which provably \emph{improves} with the data size, requiring an essentially constant number of iterations under reasonable assumptions. We provide theoretical and empirical evidence of the advantages of our method compared to other approaches, such as one-shot parameter averaging and ADMM.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research
MethodsAlternating Direction Method of Multipliers
