Analysis of Limited-Memory BFGS on a Class of Nonsmooth Convex Functions
Azam Asl, Michael L. Overton

TL;DR
This paper investigates the behavior of scaled memoryless BFGS on a class of nonsmooth convex functions, revealing conditions under which it converges to non-optimal points and comparing its performance to gradient methods.
Contribution
The study provides a detailed analysis of scaled memoryless BFGS on nonsmooth functions, showing its convergence properties and failure conditions, which were previously not well understood.
Findings
Scaled memoryless BFGS converges to non-optimal points under certain conditions.
It fails on specific nonsmooth functions when a parameter exceeds a threshold.
Numerical experiments suggest similar behavior for scaled L-BFGS with multiple updates.
Abstract
The limited memory BFGS (L-BFGS) method is widely used for large-scale unconstrained optimization, but its behavior on nonsmooth problems has received little attention. L-BFGS can be used with or without "scaling"; the use of scaling is normally recommended. A simple special case, when just one BFGS update is stored and used at every iteration, is sometimes also known as memoryless BFGS. We analyze memoryless BFGS with scaling, using any Armijo-Wolfe line search, on the function , initiated at any point with . We show that if , the absolute value of the normalized search direction generated by this method converges to a constant vector, and if, in addition, is larger than a quantity that depends on the Armijo parameter, then the iterates converge to a non-optimal point with $\bar…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
