MSTGD:A Memory Stochastic sTratified Gradient Descent Method with an Exponential Convergence Rate
Aixiang (Andy) Chen, Jinting Zhang, Zanbo Zhang, Zhihong Li

TL;DR
This paper introduces MSTGD, a novel gradient descent algorithm that leverages memory and stratified sampling to achieve an exponential convergence rate independent of dataset size and parameters.
Contribution
The paper proposes MSTGD, a gradient descent method with variance reduction strategies that guarantees exponential convergence at a constant step size, regardless of dataset size.
Findings
MSTGD achieves exponential convergence rate theoretically proven.
Experimental results confirm the effectiveness of MSTGD.
Convergence rate is independent of dataset size and batch size.
Abstract
The fluctuation effect of gradient expectation and variance caused by parameter update between consecutive iterations is neglected or confusing by current mainstream gradient optimization algorithms.Using this fluctuation effect, combined with the stratified sampling strategy, this paper designs a novel \underline{M}emory \underline{S}tochastic s\underline{T}ratified Gradient Descend(\underline{MST}GD) algorithm with an exponential convergence rate. Specifically, MSTGD uses two strategies for variance reduction: the first strategy is to perform variance reduction according to the proportion p of used historical gradient, which is estimated from the mean and variance of sample gradients before and after iteration, and the other strategy is stratified sampling by category. The statistic \ \ designed under these two strategies can be adaptively unbiased, and its variance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Machine Learning and ELM · Neural Networks and Applications
