Global Convergence of Online Limited Memory BFGS

Aryan Mokhtari; Alejandro Ribeiro

arXiv:1409.2045·math.OC·September 9, 2014·132 cites

Global Convergence of Online Limited Memory BFGS

Aryan Mokhtari, Alejandro Ribeiro

PDF

Open Access

TL;DR

This paper proves the global convergence of an online limited memory BFGS algorithm for stochastic optimization in large-scale machine learning, demonstrating theoretical guarantees and practical efficiency improvements over existing methods.

Contribution

It introduces a new online limited memory BFGS method with proven convergence guarantees for stochastic objectives in large-scale learning tasks.

Findings

01

Convergence to optimal solutions with probability 1.

02

Reduced convergence time compared to stochastic gradient descent.

03

Lower storage and computational requirements than other online quasi-Newton methods.

Abstract

Global convergence of an online (stochastic) limited memory version of the Broyden-Fletcher- Goldfarb-Shanno (BFGS) quasi-Newton method for solving optimization problems with stochastic objectives that arise in large scale machine learning is established. Lower and upper bounds on the Hessian eigenvalues of the sample functions are shown to suffice to guarantee that the curvature approximation matrices have bounded determinants and traces, which, in turn, permits establishing convergence to optimal arguments with probability 1. Numerical experiments on support vector machines with synthetic data showcase reductions in convergence time relative to stochastic gradient descent algorithms as well as reductions in storage and computation relative to other online quasi-Newton methods. Experimental evaluation on a search engine advertising problem corroborates that these advantages also…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Bandit Algorithms Research