On stochastic and deterministic quasi-Newton methods for non-Strongly convex optimization: Asymptotic convergence and rate analysis
Farzad Yousefian, Angelia Nedi\'c, Uday Shanbhag

TL;DR
This paper introduces new stochastic and deterministic quasi-Newton methods for non-strongly convex optimization, providing convergence guarantees and rates without the strong convexity assumption, and demonstrates their effectiveness on large-scale problems.
Contribution
Develops an iteratively regularized stochastic limited-memory BFGS algorithm with convergence and rate analysis for non-strongly convex problems, filling a gap in existing methods.
Findings
Convergence rate of stochastic method is approximately O(k^{-(1/3 - ε)}).
Deterministic method achieves a rate of about O(1/k^{1 - ε'}).
Numerical experiments confirm effectiveness on large-scale text classification.
Abstract
Motivated by applications arising from large scale optimization and machine learning, we consider stochastic quasi-Newton (SQN) methods for solving unconstrained convex optimization problems. The convergence analysis of the SQN methods, both full and limited-memory variants, require the objective function to be strongly convex. However, this assumption is fairly restrictive and does not hold for applications such as minimizing the logistic regression loss function. To the best of our knowledge, no rate statements currently exist for SQN methods in the absence of such an assumption. Also, among the existing first-order methods for addressing stochastic optimization problems with merely convex objectives, those equipped with provable convergence rates employ averaging. However, this averaging technique has a detrimental impact on inducing sparsity. Motivated by these gaps, the main…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
