Inertial Newton Algorithms Avoiding Strict Saddle Points

Camille Castera

arXiv:2111.04596·math.OC·February 13, 2024

Inertial Newton Algorithms Avoiding Strict Saddle Points

Camille Castera

PDF

Open Access 1 Repo

TL;DR

This paper investigates second-order algorithms combining Newton's method and inertial gradient descent, demonstrating they typically avoid strict saddle points in non-convex optimization and highlighting the influence of hyper-parameters.

Contribution

It provides theoretical analysis showing these algorithms almost always escape strict saddle points, supported by numerical experiments, and explores hyper-parameter effects.

Findings

01

Algorithms avoid strict saddle points in non-convex landscapes.

02

Hyper-parameters significantly influence behavior near critical points.

03

Numerical illustrations support theoretical results.

Abstract

We study the asymptotic behavior of second-order algorithms mixing Newton's method and inertial gradient descent in non-convex landscapes. We show that, despite the Newtonian behavior of these methods, they almost always escape strict saddle points. We also evidence the role played by the hyper-parameters of these methods in their qualitative behavior near critical points. The theoretical results are supported by numerical illustrations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

camcastera/innaavoidssaddles
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research · Iterative Methods for Nonlinear Equations