Non-Convex Self-Concordant Functions: Practical Algorithms and Complexity Analysis
Donald Goldfarb, Lexiao Lai, Tianyi Lin, Jiayu Zhang

TL;DR
This paper extends self-concordance to non-convex functions, proposing second-order algorithms with convergence guarantees and demonstrating their efficiency in large-scale and neural network optimization.
Contribution
It introduces weakly and F-based self-concordant function classes and develops algorithms with global convergence guarantees for non-convex optimization.
Findings
Algorithms achieve $O(\epsilon^{-2})$ convergence to first-order stationary points.
Adaptive method attains second-order stationary points with negative curvature detection.
Experimental results show improved robustness and efficiency over existing methods.
Abstract
We extend the standard notion of self-concordance to non-convex optimization and develop a family of second-order algorithms with global convergence guarantees. In particular, two function classes -- \textit{weakly self-concordant} functions and \textit{-based self-concordant} functions -- generalize the self-concordant framework beyond convexity, without assuming the Lipschitz continuity of the gradient or Hessian. For these function classes, we propose a regularized Newton method and an adaptive regularization method that achieve an -approximate first-order stationary point in iterations. Equipped with an oracle capable of detecting negative curvature, the adaptive algorithm can further attain convergence to an approximate second-order stationary point. Our experimental results demonstrate that the proposed methods offer superior robustness and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
