Stochastic Second-Order Methods Improve Best-Known Sample Complexity of SGD for Gradient-Dominated Function
Saeed Masiha, Saber Salehkaleybar, Niao He, Negar Kiyavash, Patrick, Thiran

TL;DR
This paper demonstrates that Stochastic Cubic Regularized Newton (SCRN) methods significantly improve sample complexity over traditional stochastic gradient descent for gradient-dominated functions, with applications in machine learning and reinforcement learning.
Contribution
The paper introduces SCRN for gradient-dominated functions, providing improved sample complexity bounds and demonstrating effectiveness in reinforcement learning scenarios.
Findings
SCRN achieves better sample complexity than SGD for gradient-dominated functions.
SCRN's performance is validated through experiments in reinforcement learning.
Variance reduction further improves SCRN's efficiency for certain cases.
Abstract
We study the performance of Stochastic Cubic Regularized Newton (SCRN) on a class of functions satisfying gradient dominance property with which holds in a wide range of applications in machine learning and signal processing. This condition ensures that any first-order stationary point is a global optimum. We prove that the total sample complexity of SCRN in achieving -global optimum is for and for . SCRN improves the best-known sample complexity of stochastic gradient descent. Even under a weak version of gradient dominance property, which is applicable to policy-based reinforcement learning (RL), SCRN achieves the same improvement over stochastic policy gradient methods. Additionally, we show that the average sample complexity of SCRN…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Stochastic Gradient Optimization Techniques · Machine Learning and ELM
