A Newton-Based Method for Nonconvex Optimization with Fast Evasion of Saddle Points
Santiago Paternain, Aryan Mokhtari, Alejandro Ribeiro

TL;DR
This paper introduces a modified Newton-based method that efficiently escapes saddle points in nonconvex optimization problems, ensuring faster convergence to local minima in machine learning tasks.
Contribution
It proposes a novel second-order algorithm that replaces negative Hessian eigenvalues with their absolute values, enabling rapid saddle point escape.
Findings
Escapes saddle points in at most logarithmic iterations
Converges to a local minimum with high probability
Operates independently of problem-specific constants
Abstract
Machine learning problems such as neural network training, tensor decomposition, and matrix factorization, require local minimization of a nonconvex function. This local minimization is challenged by the presence of saddle points, of which there can be many and from which descent methods may take inordinately large number of iterations to escape. This paper presents a second-order method that modifies the update of Newton's method by replacing the negative eigenvalues of the Hessian by their absolute values and uses a truncated version of the resulting matrix to account for the objective's curvature. The method is shown to escape saddles in at most iterations where is the target optimality and characterizes a point sufficiently far away from the saddle. This base of this exponential escape is independently of problem…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research
