Hyperfast Second-Order Local Solvers for Efficient Statistically   Preconditioned Distributed Optimization

Pavel Dvurechensky; Dmitry Kamzolov; Aleksandr Lukashevich; Soomin; Lee; Erik Ordentlich; C\'esar A. Uribe; Alexander Gasnikov

arXiv:2102.08246·math.OC·October 5, 2022·EURO J. Comput. Optim.

Hyperfast Second-Order Local Solvers for Efficient Statistically Preconditioned Distributed Optimization

Pavel Dvurechensky, Dmitry Kamzolov, Aleksandr Lukashevich, Soomin, Lee, Erik Ordentlich, C\'esar A. Uribe, Alexander Gasnikov

PDF

Open Access

TL;DR

This paper introduces an inexact accelerated method for distributed optimization that leverages high-order auxiliary problem solvers, significantly improving efficiency in large-scale empirical risk minimization tasks.

Contribution

It develops an inexact adaptive accelerated Bregman proximal gradient method and a hyperfast second-order solver for auxiliary problems, enabling efficient large-scale distributed optimization.

Findings

01

Achieved linear convergence with the hyperfast second-order method.

02

Demonstrated practical efficiency on large-scale logistic regression datasets.

03

Provided the first empirical results of high-order methods on large-scale problems.

Abstract

Statistical preconditioning enables fast methods for distributed large-scale empirical risk minimization problems. In this approach, multiple worker nodes compute gradients in parallel, which are then used by the central node to update the parameter by solving an auxiliary (preconditioned) smaller-scale optimization problem. The recently proposed Statistically Preconditioned Accelerated Gradient (SPAG) method has complexity bounds superior to other such algorithms but requires an exact solution for computationally intensive auxiliary optimization problems at every iteration. In this paper, we propose an Inexact SPAG (InSPAG) and explicitly characterize the accuracy by which the corresponding auxiliary subproblem needs to be solved to guarantee the same convergence rate as the exact method. We build our results by first developing an inexact adaptive accelerated Bregman proximal gradient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research