A Single-Loop Stochastic Proximal Quasi-Newton Method for Large-Scale   Nonsmooth Convex Optimization

Yongcun Song; Zimeng Wang; Xiaoming Yuan; Hangrui Yue

arXiv:2409.16971·math.OC·December 24, 2024

A Single-Loop Stochastic Proximal Quasi-Newton Method for Large-Scale Nonsmooth Convex Optimization

Yongcun Song, Zimeng Wang, Xiaoming Yuan, Hangrui Yue

PDF

Open Access

TL;DR

This paper introduces a novel single-loop stochastic proximal quasi-Newton method that combines variance reduction techniques with efficient Hessian approximation, achieving linear convergence for large-scale nonsmooth convex optimization.

Contribution

It develops a new stochastic proximal quasi-Newton algorithm integrating L-SVRG with L-BFGS, providing convergence guarantees and efficient subproblem solving for large-scale problems.

Findings

01

Proves global linear convergence under mild conditions.

02

Demonstrates the method's efficiency on regularized logistic regression.

03

Shows the method's flexibility with variance reduction techniques.

Abstract

We propose a new stochastic proximal quasi-Newton method for minimizing the sum of two convex functions in the particular context that one of the functions is the average of a large number of smooth functions and the other one is nonsmooth. The new method integrates a simple single-loop SVRG (L-SVRG) technique for sampling the gradient and a stochastic limited-memory BFGS (L-BFGS) scheme for approximating the Hessian of the smooth function components. The globally linear convergence rate of the new method is proved under mild assumptions. It is also shown that the new method covers a proximal variant of the L-SVRG as a special case, and it allows for various generalizations through the integration with other variance reduction methods. For example, the L-SVRG can be replaced with the SAGA or SEGA in the proposed new method and thus other new stochastic proximal quasi-Newton methods with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research