A Dense Initialization for Limited-Memory Quasi-Newton Methods
Johannes Brust, Oleg Burdakov, Jennifer B. Erway, and Roummel F., Marcia

TL;DR
This paper introduces a novel dense eigendecomposition-based initialization for limited-memory quasi-Newton methods, improving performance in nonconvex optimization by leveraging a shape-changing trust-region approach.
Contribution
It proposes a new dense initialization scheme for L-BFGS that enhances optimization efficiency without explicitly forming matrices, applicable to various quasi-Newton methods.
Findings
Outperforms traditional diagonal initialization in numerical tests
Effective in nonconvex unconstrained optimization problems
Broad applicability to quasi-Newton trust-region and line search methods
Abstract
We consider a family of dense initializations for limited-memory quasi-Newton methods. The proposed initialization exploits an eigendecomposition-based separation of the full space into two complementary subspaces, assigning a different initialization parameter to each subspace. This family of dense initializations is proposed in the context of a limited-memory Broyden-Fletcher-Goldfarb-Shanno (L-BFGS) trust-region method that makes use of a shape-changing norm to define each subproblem. As with L-BFGS methods that traditionally use diagonal initialization, the dense initialization and the sequence of generated quasi-Newton matrices are never explicitly formed. Numerical experiments on the CUTEst test set suggest that this initialization together with the shape-changing trust-region method outperforms other L-BFGS methods for solving general nonconvex unconstrained optimization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optimization Algorithms Research · Optimization and Variational Analysis · Matrix Theory and Algorithms
