Quasi-Newton method of Optimization is proved to be a steepest descent   method under the ellipsoid norm

Jiongcheng Li

arXiv:2411.11286·math.OC·November 19, 2024

Quasi-Newton method of Optimization is proved to be a steepest descent method under the ellipsoid norm

Jiongcheng Li

PDF

Open Access

TL;DR

This paper proves that Quasi-Newton methods can be viewed as steepest descent methods when measured with an ellipsoid norm, providing new theoretical insights into their convergence properties.

Contribution

The paper demonstrates that Quasi-Newton methods are equivalent to steepest descent methods under an ellipsoid norm, generalizing classical inequalities and deepening theoretical understanding.

Findings

01

Quasi-Newton methods are steepest descent under ellipsoid norm

02

Introduction of generalized Cauchy-Schwartz inequalities

03

Theoretical proof of equivalence between Quasi-Newton and steepest descent

Abstract

Optimization problems, arise in many practical applications, from the view points of both theory and numerical methods. Especially, significant improvement in deep learning training came from the Quasi-Newton methods. Quasi-Newton search directions provide an attractive alternative to Newton's method in that they do not require computation of the Hessian and yet still attain a super linear rate of convergence. In Quasi-Newton method, we require Hessian approximation to satisfy the secant equation. In this paper, the Classical Cauchy-Schwartz Inequality is introduced, then more generalization are proposed. And it is seriously proved that Quasi-Newton method is a steepest descent method under the ellipsoid norm.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIterative Methods for Nonlinear Equations · Advanced Optimization Algorithms Research · Aerospace Engineering and Control Systems