Stochastic Newton Proximal Extragradient Method
Ruichen Jiang, Micha{\l} Derezi\'nski, Aryan Mokhtari

TL;DR
This paper introduces a stochastic Newton proximal extragradient method that enhances convergence rates for strongly convex optimization, combining fast global linear and superlinear local convergence using noisy Hessian estimates.
Contribution
It extends the HPE framework to achieve improved global and local convergence rates for stochastic second-order methods with noisy Hessian oracles.
Findings
Achieves faster global linear convergence rate.
Reaches superlinear convergence in fewer iterations.
Improves upon previous Hessian averaging methods.
Abstract
Stochastic second-order methods achieve fast local convergence in strongly convex optimization by using noisy Hessian estimates to precondition the gradient. However, these methods typically reach superlinear convergence only when the stochastic Hessian noise diminishes, increasing per-iteration costs over time. Recent work in [arXiv:2204.09266] addressed this with a Hessian averaging scheme that achieves superlinear convergence without higher per-iteration costs. Nonetheless, the method has slow global convergence, requiring up to iterations to reach the superlinear rate of , where is the problem's condition number. In this paper, we propose a novel stochastic Newton proximal extragradient method that improves these bounds, achieving a faster global linear rate and reaching the same fast superlinear rate in …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Optimization Algorithms Research · Iterative Methods for Nonlinear Equations
