A Distributed Quasi-Newton Algorithm for Primal and Dual Regularized   Empirical Risk Minimization

Ching-pei Lee; Cong Han Lim; Stephen J. Wright

arXiv:1912.06508·cs.LG·December 16, 2019·1 cites

A Distributed Quasi-Newton Algorithm for Primal and Dual Regularized Empirical Risk Minimization

Ching-pei Lee, Cong Han Lim, Stephen J. Wright

PDF

Open Access 1 Repo

TL;DR

This paper introduces a distributed second-order optimization algorithm for empirical risk minimization that efficiently uses curvature information, improving convergence speed and reducing communication costs in both primal and dual settings.

Contribution

It presents a novel distributed quasi-Newton method that leverages global Hessian approximations, outperforming existing methods especially in dual ERM problems.

Findings

01

Significantly reduces communication costs compared to state-of-the-art methods.

02

Achieves global linear convergence for a wide range of ERM problems.

03

Demonstrates faster convergence and lower runtime in computational experiments.

Abstract

We propose a communication- and computation-efficient distributed optimization algorithm using second-order information for solving empirical risk minimization (ERM) problems with a nonsmooth regularization term. Our algorithm is applicable to both the primal and the dual ERM problem. Current second-order and quasi-Newton methods for this problem either do not work well in the distributed setting or work only for specific regularizers. Our algorithm uses successive quadratic approximations of the smooth part, and we describe how to maintain an approximation of the (generalized) Hessian and solve subproblems efficiently in a distributed manner. When applied to the distributed dual ERM problem, unlike state of the art that takes only the block-diagonal part of the Hessian, our approach is able to utilize global curvature information and is thus magnitudes faster. The proposed method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

leepei/dplbfgs
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research