Explicit Convergence Rates of Greedy and Random Quasi-Newton Methods

Dachao Lin; Haishan Ye; Zhihua Zhang

arXiv:2104.08764·math.OC·September 13, 2022·J. Mach. Learn. Res.·5 cites

Explicit Convergence Rates of Greedy and Random Quasi-Newton Methods

Dachao Lin, Haishan Ye, Zhihua Zhang

PDF

Open Access

TL;DR

This paper establishes explicit superlinear convergence rates for both greedy and random quasi-Newton methods, including BFGS and SR1, extending previous results and improving convergence guarantees for these optimization algorithms.

Contribution

It extends convergence rate results to random quasi-Newton methods and provides improved superlinear convergence guarantees for BFGS and SR1 methods.

Findings

01

Random quasi-Newton methods have explicit superlinear convergence rates.

02

Improved convergence rates for BFGS and SR1 methods.

03

Analysis applies to strongly convex, smooth, and self-concordant functions.

Abstract

Optimization is important in machine learning problems, and quasi-Newton methods have a reputation as the most efficient numerical schemes for smooth unconstrained optimization. In this paper, we consider the explicit superlinear convergence rates of quasi-Newton methods and address two open problems mentioned by Rodomanov and Nesterov. First, we extend Rodomanov and Nesterov's results to random quasi-Newton methods, which include common DFP, BFGS, SR1 methods. Such random methods adopt a random direction for updating the approximate Hessian matrix in each iteration. Second, we focus on the specific quasi-Newton methods: SR1 and BFGS methods. We provide improved versions of greedy and random methods with provable better explicit (local) superlinear convergence rates. Our analysis is closely related to the approximation of a given Hessian matrix, unconstrained quadratic objective, as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research