Inexact Non-Convex Newton-Type Methods
Zhewei Yao, Peng Xu, Farbod Roosta-Khorasani, Michael W. Mahoney

TL;DR
This paper introduces inexact non-convex Newton-type methods that efficiently solve large-scale problems by using approximations for gradients and Hessians, maintaining optimal iteration complexity without needing problem-specific parameters.
Contribution
It develops inexact trust region and cubic regularization methods with mild approximation conditions, extending their applicability to large-scale problems with practical, implementable algorithms.
Findings
Achieve similar iteration complexity as exact methods
Effective in large-scale finite-sum problems
Empirical results show promising performance on real datasets
Abstract
For solving large-scale non-convex problems, we propose inexact variants of trust region and adaptive cubic regularization methods, which, to increase efficiency, incorporate various approximations. In particular, in addition to approximate sub-problem solves, both the Hessian and the gradient are suitably approximated. Using rather mild conditions on such approximations, we show that our proposed inexact methods achieve similar optimal worst-case iteration complexities as the exact counterparts. Our proposed algorithms, and their respective theoretical analysis, do not require knowledge of any unknowable problem-related quantities, and hence are easily implementable in practice. In the context of finite-sum problems, we then explore randomized sub-sampling methods as ways to construct the gradient and Hessian approximations and examine the empirical performance of our algorithms on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Machine Learning and Algorithms
