Nesterov's Acceleration For Approximate Newton
Haishan Ye, Zhihua Zhang

TL;DR
This paper applies Nesterov's acceleration to approximate Newton methods in optimization, improving convergence and practical performance in machine learning tasks, especially when Hessian approximation is challenging.
Contribution
It introduces an accelerated regularized sub-sampled Newton method that leverages Nesterov's acceleration to enhance convergence and empirical performance.
Findings
Accelerated method outperforms original regularized sub-sampled Newton in experiments.
Theoretical analysis confirms acceleration improves convergence for approximate Newton.
Performance is comparable or superior to classical algorithms.
Abstract
Optimization plays a key role in machine learning. Recently, stochastic second-order methods have attracted much attention due to their low computational cost in each iteration. However, these algorithms might perform poorly especially if it is hard to approximate the Hessian well and efficiently. As far as we know, there is no effective way to handle this problem. In this paper, we resort to Nesterov's acceleration technique to improve the convergence performance of a class of second-order methods called approximate Newton. We give a theoretical analysis that Nesterov's acceleration technique can improve the convergence performance for approximate Newton just like for first-order methods. We accordingly propose an accelerated regularized sub-sampled Newton. Our accelerated algorithm performs much better than the original regularized sub-sampled Newton in experiments, which validates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Matrix Theory and Algorithms · Advanced Optimization Algorithms Research
