Global Convergence of Online Limited Memory BFGS
Aryan Mokhtari, Alejandro Ribeiro

TL;DR
This paper proves the global convergence of an online limited memory BFGS algorithm for stochastic optimization in large-scale machine learning, demonstrating theoretical guarantees and practical efficiency improvements over existing methods.
Contribution
It introduces a new online limited memory BFGS method with proven convergence guarantees for stochastic objectives in large-scale learning tasks.
Findings
Convergence to optimal solutions with probability 1.
Reduced convergence time compared to stochastic gradient descent.
Lower storage and computational requirements than other online quasi-Newton methods.
Abstract
Global convergence of an online (stochastic) limited memory version of the Broyden-Fletcher- Goldfarb-Shanno (BFGS) quasi-Newton method for solving optimization problems with stochastic objectives that arise in large scale machine learning is established. Lower and upper bounds on the Hessian eigenvalues of the sample functions are shown to suffice to guarantee that the curvature approximation matrices have bounded determinants and traces, which, in turn, permits establishing convergence to optimal arguments with probability 1. Numerical experiments on support vector machines with synthetic data showcase reductions in convergence time relative to stochastic gradient descent algorithms as well as reductions in storage and computation relative to other online quasi-Newton methods. Experimental evaluation on a search engine advertising problem corroborates that these advantages also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Bandit Algorithms Research
