Stochastic Variance-Reduced Newton: Accelerating Finite-Sum Minimization   with Large Batches

Micha{\l} Derezi\'nski

arXiv:2206.02702·math.OC·April 30, 2025·1 cites

Stochastic Variance-Reduced Newton: Accelerating Finite-Sum Minimization with Large Batches

Micha{\l} Derezi\'nski

PDF

Open Access 1 Repo

TL;DR

This paper introduces SVRN, a stochastic variance-reduced Newton method that significantly accelerates finite-sum minimization, especially for large datasets, by reducing the number of data passes needed compared to existing methods.

Contribution

The paper proposes SVRN, a novel stochastic variance-reduced Newton algorithm that accelerates second-order methods for convex finite-sum problems, with improved complexity bounds and scalability.

Findings

01

SVRN reduces data passes from O(α log(1/ε)) to O(log(1/ε)/log(n))

02

Acceleration effect increases with larger data size n

03

SVRN compares favorably to first-order variance-reduction methods

Abstract

Stochastic variance reduction has proven effective at accelerating first-order algorithms for solving convex finite-sum optimization tasks such as empirical risk minimization. Incorporating second-order information has proven helpful in further improving the performance of these first-order methods. Yet, comparatively little is known about the benefits of using variance reduction to accelerate popular stochastic second-order methods such as Subsampled Newton. To address this, we propose Stochastic Variance-Reduced Newton (SVRN), a finite-sum minimization algorithm that provably accelerates existing stochastic Newton methods from $O (α lo g (1/ ϵ))$ to $O (\frac{l o g ( 1/ ϵ )}{l o g ( n )})$ passes over the data, i.e., by a factor of $O (α lo g (n))$ , where $n$ is the number of sum components and $α$ is the approximation factor in the Hessian estimate. Surprisingly,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

svrnewton/svrn
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Bandit Algorithms Research