A Distributed Quasi-Newton Algorithm for Empirical Risk Minimization   with Nonsmooth Regularization

Ching-pei Lee; Cong Han Lim; Stephen J. Wright

arXiv:1803.01370·math.OC·May 29, 2018

A Distributed Quasi-Newton Algorithm for Empirical Risk Minimization with Nonsmooth Regularization

Ching-pei Lee, Cong Han Lim, Stephen J. Wright

PDF

1 Repo

TL;DR

This paper introduces a distributed quasi-Newton algorithm that efficiently solves empirical risk minimization problems with nonsmooth regularization, reducing communication and computation costs while ensuring convergence.

Contribution

It presents a novel distributed second-order optimization method that handles nonsmooth regularizers and converges globally for a wide class of ERM problems.

Findings

01

Significantly reduces communication costs compared to existing methods

02

Achieves global linear convergence for non-strongly convex problems

03

Demonstrates improved running time in initial experiments

Abstract

We propose a communication- and computation-efficient distributed optimization algorithm using second-order information for solving ERM problems with a nonsmooth regularization term. Current second-order and quasi-Newton methods for this problem either do not work well in the distributed setting or work only for specific regularizers. Our algorithm uses successive quadratic approximations, and we describe how to maintain an approximation of the Hessian and solve subproblems efficiently in a distributed manner. The proposed method enjoys global linear convergence for a broad range of non-strongly convex problems that includes the most commonly used ERMs, thus requiring lower communication complexity. It also converges on non-convex problems, so has the potential to be used on applications such as deep learning. Initial computational results on convex problems demonstrate that our method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

leepei/dplbfgs
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.