Communication Efficient Distributed Optimization using an Approximate   Newton-type Method

Ohad Shamir; Nathan Srebro; Tong Zhang

arXiv:1312.7853·cs.LG·May 15, 2014·84 cites

Communication Efficient Distributed Optimization using an Approximate Newton-type Method

Ohad Shamir, Nathan Srebro, Tong Zhang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new Newton-type method for distributed optimization that converges quickly and scales well with data size, especially effective for stochastic learning problems.

Contribution

It proposes a novel Newton-type algorithm tailored for distributed stochastic optimization with proven linear convergence that improves with data size.

Findings

01

Achieves linear convergence rate for quadratic objectives

02

Requires an essentially constant number of iterations as data size grows

03

Outperforms one-shot averaging and ADMM in experiments

Abstract

We present a novel Newton-type method for distributed optimization, which is particularly well suited for stochastic optimization and learning problems. For quadratic objectives, the method enjoys a linear rate of convergence which provably \emph{improves} with the data size, requiring an essentially constant number of iterations under reasonable assumptions. We provide theoretical and empirical evidence of the advantages of our method compared to other approaches, such as one-shot parameter averaging and ADMM.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

DAve-QN/source
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research

MethodsAlternating Direction Method of Multipliers