Faster stochastic cubic regularized Newton methods with momentum
Yiming Yang, Chuan He, Xiao Wang, Zheng Peng

TL;DR
This paper introduces faster stochastic cubic regularized Newton methods that incorporate momentum-based variance reduction for Hessian estimation, achieving improved iteration complexity and performance in large-scale optimization.
Contribution
The paper proposes novel SCRN algorithms with momentum-based Hessian estimation and analyzes their iteration complexity under mild assumptions.
Findings
Achieve comparable performance to deterministic CRN methods.
Significantly outperform first-order methods in iteration counts.
Effective for problems with accurate gradient estimation and costly Hessian computation.
Abstract
Cubic regularized Newton (CRN) methods have attracted signiffcant research interest because they offer stronger solution guarantees and lower iteration complexity. With the rise of the big-data era, there is growing interest in developing stochastic cubic regularized Newton (SCRN) methods that do not require exact gradient and Hessian evaluations. In this paper, we propose faster SCRN methods that incorporate gradient estimation with small, controlled errors and Hessian estimation with momentum-based variance reduction. These methods are particularly effective for problems where the gradient can be estimated accurately and at low cost, whereas accurate estimation of the Hessian is expensive. Under mild assumptions, we establish the iteration complexity of our SCRN methods by analyzing the descent of a novel potential sequence. Finally, numerical experiments show that our SCRN methods…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optimization Algorithms Research · Iterative Methods for Nonlinear Equations · Sparse and Compressive Sensing Techniques
