Inexact Non-Convex Newton-Type Methods

Zhewei Yao; Peng Xu; Farbod Roosta-Khorasani; Michael W. Mahoney

arXiv:1802.06925·math.OC·February 21, 2018·INFORMS J. Optim.·29 cites

Inexact Non-Convex Newton-Type Methods

Zhewei Yao, Peng Xu, Farbod Roosta-Khorasani, Michael W. Mahoney

PDF

Open Access

TL;DR

This paper introduces inexact non-convex Newton-type methods that efficiently solve large-scale problems by using approximations for gradients and Hessians, maintaining optimal iteration complexity without needing problem-specific parameters.

Contribution

It develops inexact trust region and cubic regularization methods with mild approximation conditions, extending their applicability to large-scale problems with practical, implementable algorithms.

Findings

01

Achieve similar iteration complexity as exact methods

02

Effective in large-scale finite-sum problems

03

Empirical results show promising performance on real datasets

Abstract

For solving large-scale non-convex problems, we propose inexact variants of trust region and adaptive cubic regularization methods, which, to increase efficiency, incorporate various approximations. In particular, in addition to approximate sub-problem solves, both the Hessian and the gradient are suitably approximated. Using rather mild conditions on such approximations, we show that our proposed inexact methods achieve similar optimal worst-case iteration complexities as the exact counterparts. Our proposed algorithms, and their respective theoretical analysis, do not require knowledge of any unknowable problem-related quantities, and hence are easily implementable in practice. In the context of finite-sum problems, we then explore randomized sub-sampling methods as ways to construct the gradient and Hessian approximations and examine the empirical performance of our algorithms on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Machine Learning and Algorithms