On stochastic and deterministic quasi-Newton methods for non-Strongly   convex optimization: Asymptotic convergence and rate analysis

Farzad Yousefian; Angelia Nedi\'c; Uday Shanbhag

arXiv:1710.05509·math.OC·October 2, 2019·SIAM J. Optim.

On stochastic and deterministic quasi-Newton methods for non-Strongly convex optimization: Asymptotic convergence and rate analysis

Farzad Yousefian, Angelia Nedi\'c, Uday Shanbhag

PDF

TL;DR

This paper introduces new stochastic and deterministic quasi-Newton methods for non-strongly convex optimization, providing convergence guarantees and rates without the strong convexity assumption, and demonstrates their effectiveness on large-scale problems.

Contribution

Develops an iteratively regularized stochastic limited-memory BFGS algorithm with convergence and rate analysis for non-strongly convex problems, filling a gap in existing methods.

Findings

01

Convergence rate of stochastic method is approximately O(k^{-(1/3 - ε)}).

02

Deterministic method achieves a rate of about O(1/k^{1 - ε'}).

03

Numerical experiments confirm effectiveness on large-scale text classification.

Abstract

Motivated by applications arising from large scale optimization and machine learning, we consider stochastic quasi-Newton (SQN) methods for solving unconstrained convex optimization problems. The convergence analysis of the SQN methods, both full and limited-memory variants, require the objective function to be strongly convex. However, this assumption is fairly restrictive and does not hold for applications such as minimizing the logistic regression loss function. To the best of our knowledge, no rate statements currently exist for SQN methods in the absence of such an assumption. Also, among the existing first-order methods for addressing stochastic optimization problems with merely convex objectives, those equipped with provable convergence rates employ averaging. However, this averaging technique has a detrimental impact on inducing sparsity. Motivated by these gaps, the main…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.