Nys-Newton: Nystr\"om-Approximated Curvature for Stochastic Optimization
Dinesh Singh, Hardik Tankaria, Makoto Yamada

TL;DR
This paper introduces Nys-Newton, a stochastic optimization method that uses Nyström approximation of the Hessian to improve second-order optimization for large-scale convex and non-convex problems, achieving competitive performance.
Contribution
It proposes a novel Nyström-based Hessian approximation integrated with stochastic gradient methods, reducing computational complexity while maintaining effective curvature information.
Findings
Achieves better Hessian approximation than traditional methods.
Performs competitively with state-of-the-art first-order and quasi-Newton methods.
Provides theoretical convergence analysis for convex functions.
Abstract
Second-order optimization methods are among the most widely used optimization approaches for convex optimization problems, and have recently been used to optimize non-convex optimization problems such as deep learning models. The widely used second-order optimization methods such as quasi-Newton methods generally provide curvature information by approximating the Hessian using the secant equation. However, the secant equation becomes insipid in approximating the Newton step owing to its use of the first-order derivatives. In this study, we propose an approximate Newton sketch-based stochastic optimization algorithm for large-scale empirical risk minimization. Specifically, we compute a partial column Hessian of size () with randomly selected variables, then use the \emph{Nystr\"om method} to better approximate the full Hessian matrix. To further reduce the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research
MethodsLogistic Regression
