A Stochastic Variance Reduced Gradient using Barzilai-Borwein Techniques   as Second Order Information

Hardik Tankaria; Nobuo Yamashita

arXiv:2208.11075·math.OC·August 24, 2022

A Stochastic Variance Reduced Gradient using Barzilai-Borwein Techniques as Second Order Information

Hardik Tankaria, Nobuo Yamashita

PDF

Open Access

TL;DR

This paper enhances the stochastic variance reduced gradient (SVRG) method by integrating Barzilai-Borwein curvature information, leading to improved convergence and performance on benchmark datasets.

Contribution

It introduces a novel SVRG variant that incorporates Barzilai-Borwein techniques for variance reduction and second-order information, with proven linear convergence.

Findings

01

Proposed method outperforms existing variance reduced methods on some benchmark problems.

02

The method demonstrates linear convergence under certain conditions.

03

Numerical experiments validate the efficiency of the approach.

Abstract

In this paper, we consider to improve the stochastic variance reduce gradient (SVRG) method via incorporating the curvature information of the objective function. We propose to reduce the variance of stochastic gradients using the computationally efficient Barzilai-Borwein (BB) method by incorporating it into the SVRG. We also incorporate a BB-step size as its variant. We prove its linear convergence theorem that works not only for the proposed method but also for the other existing variants of SVRG with second-order information. We conduct the numerical experiments on the benchmark datasets and show that the proposed method with constant step size performs better than the existing variance reduced methods for some test problems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Neural Networks and Applications · Machine Learning and ELM

MethodsTest