Accelerating Stochastic Recursive and Semi-stochastic Gradient Methods with Adaptive Barzilai-Borwein Step Sizes
Jiangshan Wang, Yiming Yang, Zheng Peng

TL;DR
This paper introduces new adaptive step size rules for stochastic gradient methods, improving convergence and performance through variance reduction, importance sampling, and self-adaptation, with theoretical analysis and extensive experiments.
Contribution
Proposes two novel adaptive step size rules, RHBB and RHBB+, for stochastic gradient methods, enhancing robustness, efficiency, and scalability with theoretical convergence analysis.
Findings
Proposed methods outperform existing algorithms on benchmark datasets.
RHBB+ effectively incorporates importance sampling for better step size adaptation.
The algorithms demonstrate strong scalability and robustness in experiments.
Abstract
The mini-batch versions of StochAstic Recursive grAdient algoritHm and Semi-Stochastic Gradient Descent method, employed the random Barzilai-Borwein step sizes (shorted as MB-SARAH-RBB and mS2GD-RBB), have surged into prominence through timely step size sequence. Inspired by modern adaptors and variance reduction techniques, we propose two new variant rules in the paper, referred to as RHBB and RHBB+, thereby leading to four algorithms MB-SARAH-RHBB, MB-SARAH-RHBB+, mS2GD-RHBB and mS2GD-RHBB+ respectively. RHBB+ is an enhanced version that additionally incorporates the importance sampling technique. They are aggressive in updates, robust in performance and self-adaptive along iterative periods. We analyze the flexible convergence structures and the corresponding complexity bounds in strongly convex cases. Comprehensive tuning guidance is theoretically provided for reference in practical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Face and Expression Recognition
