Quasi-Newton Methods: Superlinear Convergence Without Line Searches for   Self-Concordant Functions

Wenbo Gao; Donald Goldfarb

arXiv:1612.06965·math.OC·August 13, 2018·Optim. Methods Softw.·2 cites

Quasi-Newton Methods: Superlinear Convergence Without Line Searches for Self-Concordant Functions

Wenbo Gao, Donald Goldfarb

PDF

Open Access

TL;DR

This paper introduces a curvature-adaptive step size for quasi-Newton methods, enabling superlinear convergence on self-concordant functions without line searches, and demonstrates its effectiveness through numerical experiments.

Contribution

It extends Nesterov's curvature-adaptive step size to quasi-Newton methods, achieving superlinear convergence without line searches on self-concordant functions.

Findings

01

Superlinear convergence achieved with BFGS using adaptive step size

02

Numerical experiments show improved performance over traditional methods

03

Adaptive step size simplifies implementation by removing line searches

Abstract

We consider the use of a curvature-adaptive step size in gradient-based iterative methods, including quasi-Newton methods, for minimizing self-concordant functions, extending an approach first proposed for Newton's method by Nesterov. This step size has a simple expression that can be computed analytically; hence, line searches are not needed. We show that using this step size in the BFGS method (and quasi-Newton methods in the Broyden convex class other than the DFP method) results in superlinear convergence for strongly convex self-concordant functions. We present numerical experiments comparing gradient descent and BFGS methods using the curvature-adaptive step size to traditional methods on deterministic logistic regression problems, and to versions of stochastic gradient descent on stochastic optimization problems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research