TL;DR
This paper introduces JacSketch, a variance reduction method for stochastic gradient descent that uses Jacobian sketching to improve convergence rates, unifying and enhancing several existing algorithms.
Contribution
The paper proposes JacSketch, a novel stochastic quasi-gradient method leveraging Jacobian sketching, providing improved convergence analysis and unifying various existing variance reduction techniques.
Findings
JacSketch converges linearly for smooth, strongly convex functions.
The method generalizes and improves upon SAGA and related algorithms.
New rates for minibatch and importance sampling variants are established.
Abstract
We develop a new family of variance reduced stochastic gradient descent methods for minimizing the average of a very large number of smooth functions. Our method --JacSketch-- is motivated by novel developments in randomized numerical linear algebra, and operates by maintaining a stochastic estimate of a Jacobian matrix composed of the gradients of individual functions. In each iteration, JacSketch efficiently updates the Jacobian matrix by first obtaining a random linear measurement of the true Jacobian through (cheap) sketching, and then projecting the previous estimate onto the solution space of a linear matrix equation whose solutions are consistent with the measurement. The Jacobian estimate is then used to compute a variance-reduced unbiased estimator of the gradient. Our strategy is analogous to the way quasi-Newton methods maintain an estimate of the Hessian, and hence our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Stochastic Quasi-Gradient Methods: Variance Reduction via Jacobian Sketching· youtube
