Second-order optimization with lazy Hessians

Nikita Doikov; El Mahdi Chayti; Martin Jaggi

arXiv:2212.00781·math.OC·June 16, 2023·1 cites

Second-order optimization with lazy Hessians

Nikita Doikov, El Mahdi Chayti, Martin Jaggi

PDF

Open Access 1 Video

TL;DR

This paper introduces a lazy Hessian update strategy for Newton's method that reduces computational complexity while maintaining fast convergence to second-order stationary points in non-convex optimization.

Contribution

It proposes reusing Hessians over multiple iterations, establishing convergence guarantees, and identifying optimal update frequency to improve efficiency in second-order methods.

Findings

01

Lazy Hessian updates reduce computational complexity.

02

The method converges to second-order stationary points with fast rates.

03

Optimal Hessian update frequency is every d iterations, improving complexity by √d.

Abstract

We analyze Newton's method with lazy Hessian updates for solving general possibly non-convex optimization problems. We propose to reuse a previously seen Hessian for several iterations while computing new gradients at each step of the method. This significantly reduces the overall arithmetical complexity of second-order optimization schemes. By using the cubic regularization technique, we establish fast global convergence of our method to a second-order stationary point, while the Hessian does not need to be updated each iteration. For convex problems, we justify global and local superlinear rates for lazy Newton steps with quadratic regularization, which is easier to compute. The optimal frequency for updating the Hessian is once every $d$ iterations, where $d$ is the dimension of the problem. This provably improves the total arithmetical complexity of second-order algorithms by a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Second-Order Optimization with Lazy Hessians· slideslive

Taxonomy

TopicsAdvanced Optimization Algorithms Research · Sparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques