Hessian Averaging in Stochastic Newton Methods Achieves Superlinear Convergence
Sen Na, Micha{\l} Derezi\'nski, Michael W. Mahoney

TL;DR
This paper introduces Hessian averaging in stochastic Newton methods, enabling superlinear convergence without increasing per-iteration cost, by averaging past Hessian estimates to reduce noise.
Contribution
It proposes Hessian averaging schemes that achieve superlinear convergence in stochastic Newton methods, addressing limitations of existing approaches.
Findings
Hessian averaging achieves local superlinear convergence.
Weighted averaging schemes can accelerate convergence.
Universal weighting schemes transition effectively to local convergence.
Abstract
We consider minimizing a smooth and strongly convex objective function using a stochastic Newton method. At each iteration, the algorithm is given an oracle access to a stochastic estimate of the Hessian matrix. The oracle model includes popular algorithms such as Subsampled Newton and Newton Sketch. Despite using second-order information, these existing methods do not exhibit superlinear convergence, unless the stochastic noise is gradually reduced to zero during the iteration, which would lead to a computational blow-up in the per-iteration cost. We propose to address this limitation with Hessian averaging: instead of using the most recent Hessian estimate, our algorithm maintains an average of all the past estimates. This reduces the stochastic noise while avoiding the computational blow-up. We show that this scheme exhibits local -superlinear convergence with a non-asymptotic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research
