Stochastic Gradient Made Stable: A Manifold Propagation Approach for Large-Scale Optimization
Yadong Mu, Wei Liu, Wei Fan

TL;DR
This paper introduces S3GD, a stratified semi-stochastic gradient descent method that uses manifold propagation to improve stability and efficiency in large-scale convex optimization, outperforming prior algorithms.
Contribution
The paper proposes a novel S3GD algorithm that accelerates large-scale convex optimization by combining manifold propagation with semi-stochastic gradients, reducing complexity and improving convergence.
Findings
S3GD achieves faster convergence than traditional SGD.
The algorithm reduces computational complexity via manifold propagation.
Experimental results on benchmark datasets validate its efficiency.
Abstract
Stochastic gradient descent (SGD) holds as a classical method to build large scale machine learning models over big data. A stochastic gradient is typically calculated from a limited number of samples (known as mini-batch), so it potentially incurs a high variance and causes the estimated parameters bounce around the optimal solution. To improve the stability of stochastic gradient, recent years have witnessed the proposal of several semi-stochastic gradient descent algorithms, which distinguish themselves from standard SGD by incorporating global information into gradient computation. In this paper we contribute a novel stratified semi-stochastic gradient descent (S3GD) algorithm to this nascent research area, accelerating the optimization of a large family of composite convex functions. Though theoretically converging faster, prior semi-stochastic algorithms are found to suffer from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Privacy-Preserving Technologies in Data
MethodsStochastic Gradient Descent
