Inexact Newton-CG Algorithms With Complexity Guarantees
Zhewei Yao, Peng Xu, Fred Roosta, Stephen J. Wright, Michael W., Mahoney

TL;DR
This paper introduces inexact Newton-CG algorithms for nonconvex optimization that use approximate gradient and Hessian estimates, achieving optimal iteration complexity bounds with high-probability guarantees and demonstrating empirical effectiveness.
Contribution
The paper develops new inexact Newton-CG algorithms with adaptive inexactness conditions and high-probability complexity guarantees for nonconvex problems, including practical variants with empirical validation.
Findings
Algorithms achieve optimal iteration complexity bounds.
Adaptive inexactness allows crude estimates in high-gradient regions.
Empirical results demonstrate effectiveness on machine learning models.
Abstract
We consider variants of a recently-developed Newton-CG algorithm for nonconvex problems \citep{royer2018newton} in which inexact estimates of the gradient and the Hessian information are used for various steps. Under certain conditions on the inexactness measures, we derive iteration complexity bounds for achieving -approximate second-order optimality that match best-known lower bounds. Our inexactness condition on the gradient is adaptive, allowing for crude accuracy in regions with large gradients. We describe two variants of our approach, one in which the step-size along the computed search direction is chosen adaptively and another in which the step-size is pre-defined. To obtain second-order optimality, our algorithms will make use of a negative curvature direction on some steps. These directions can be obtained, with high-probability, using a certain randomized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Machine Learning and Algorithms · Sparse and Compressive Sensing Techniques
