Sub-Sampled Newton Methods I: Globally Convergent Algorithms
Farbod Roosta-Khorasani, Michael W. Mahoney

TL;DR
This paper introduces globally convergent sub-sampled Newton algorithms for large-scale optimization, providing non-asymptotic bounds and analyzing the effects of Hessian and gradient sub-sampling with regularization.
Contribution
It develops new second-order optimization algorithms with convergence guarantees using sub-sampling techniques and randomized linear algebra, applicable to large-scale problems.
Findings
Algorithms are globally convergent from any initial point.
Sub-sampling Hessian and gradient reduces computational complexity.
Convergence bounds are non-asymptotic and quantitative.
Abstract
Large scale optimization problems are ubiquitous in machine learning and data analysis and there is a plethora of algorithms for solving such problems. Many of these algorithms employ sub-sampling, as a way to either speed up the computations and/or to implicitly implement a form of statistical regularization. In this paper, we consider second-order iterative optimization algorithms and we provide bounds on the convergence of the variants of Newton's method that incorporate uniform sub-sampling as a means to estimate the gradient and/or Hessian. Our bounds are non-asymptotic and quantitative. Our algorithms are global and are guaranteed to converge from any initial iterate. Using random matrix concentration inequalities, one can sub-sample the Hessian to preserve the curvature information. Our first algorithm incorporates Hessian sub-sampling while using the full gradient. We also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
