A Variance Reduced Stochastic Newton Method
Aurelien Lucchi, Brian McWilliams, Thomas Hofmann

TL;DR
This paper introduces Vite, a stochastic Quasi-Newton method that reduces variance in Hessian approximations, achieving geometric convergence rates and outperforming existing methods in practice.
Contribution
Vite is a novel variance-reduced stochastic Quasi-Newton algorithm that guarantees geometric convergence without relying on specific Hessian forms.
Findings
Vite reaches the optimum at a geometric rate with a constant step-size.
Empirical results show Vite outperforms existing stochastic Quasi-Newton methods.
Vite effectively reduces variance in Hessian approximations, improving convergence.
Abstract
Quasi-Newton methods are widely used in practise for convex loss minimization problems. These methods exhibit good empirical performance on a wide variety of tasks and enjoy super-linear convergence to the optimal solution. For large-scale learning problems, stochastic Quasi-Newton methods have been recently proposed. However, these typically only achieve sub-linear convergence rates and have not been shown to consistently perform well in practice since noisy Hessian approximations can exacerbate the effect of high-variance stochastic gradient estimates. In this work we propose Vite, a novel stochastic Quasi-Newton algorithm that uses an existing first-order technique to reduce this variance. Without exploiting the specific form of the approximate Hessian, we show that Vite reaches the optimum at a geometric rate with a constant step-size when dealing with smooth strongly convex…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research
