Hyperfast Second-Order Local Solvers for Efficient Statistically Preconditioned Distributed Optimization
Pavel Dvurechensky, Dmitry Kamzolov, Aleksandr Lukashevich, Soomin, Lee, Erik Ordentlich, C\'esar A. Uribe, Alexander Gasnikov

TL;DR
This paper introduces an inexact accelerated method for distributed optimization that leverages high-order auxiliary problem solvers, significantly improving efficiency in large-scale empirical risk minimization tasks.
Contribution
It develops an inexact adaptive accelerated Bregman proximal gradient method and a hyperfast second-order solver for auxiliary problems, enabling efficient large-scale distributed optimization.
Findings
Achieved linear convergence with the hyperfast second-order method.
Demonstrated practical efficiency on large-scale logistic regression datasets.
Provided the first empirical results of high-order methods on large-scale problems.
Abstract
Statistical preconditioning enables fast methods for distributed large-scale empirical risk minimization problems. In this approach, multiple worker nodes compute gradients in parallel, which are then used by the central node to update the parameter by solving an auxiliary (preconditioned) smaller-scale optimization problem. The recently proposed Statistically Preconditioned Accelerated Gradient (SPAG) method has complexity bounds superior to other such algorithms but requires an exact solution for computationally intensive auxiliary optimization problems at every iteration. In this paper, we propose an Inexact SPAG (InSPAG) and explicitly characterize the accuracy by which the corresponding auxiliary subproblem needs to be solved to guarantee the same convergence rate as the exact method. We build our results by first developing an inexact adaptive accelerated Bregman proximal gradient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research
